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Preface 


The aim of this book is to provide a readable account of the examples and fundamental 
results of groups from a theoretical point of view and a geometrical point of view. 
The concept of a group is one of the most fundamental in modern mathematics. 
Groups are systems consisting of a set of elements and a binary operation that can 
be applied to two elements of the set, which together satisfy certain axioms. These 
require that the group is closed under the operation (the combination of any two 
elements produces another element of the group), that it obey the associative law, 
that it contains an identity element (which, combined with any other element, leaves 
the latter unchanged), and that each element have an inverse (which combines with 
an element to produce the identity element). 

Since the book does not make any assumptions about the reader’s background, it 
is suitable for newcomers to group theory and even those who never studied algebra. 
To explain many subjects we used figures and images to help the readers. 

The book is organized into eleven chapters. To get to any depth in group theory 
requires set theory, combinatorics, number theory, matrix theory, and geometry. Thus, 
not only we have devoted Chaps. 1-2 to the introductory concepts of set theory, 
combinations, number theory, and symmetry, but we have also devoted some parts 
of Chap. 7 to matrix theory. Although the main subject of the book is groups but in 
many parts we will need some information about other algebraic structures, like rings, 
fields and vector spaces. In Chap. 3, we give the definitions of group and subgroup, 
examples and some elementary properties. Several excellent and important examples 
of groups like cyclic groups, permutation groups, group of arithmetical functions, 
matrix groups and linear groups are investigated in Chaps. 4—7. In Chap. 8, Lagrange’s 
theorem as the most important theorem in finite groups is discussed. Chapters 9-11 
address the normal subgroups, factor groups, derived subgroup, homomorphism, 
isomorphism and automorphism of groups. A consequence is Cayley’s theorem in 
which every group could be realized as a permutation group. 

Some chapters or sections are labelled as optional; this means that the readers can 
ignore them for the first study. Each section ends in a collection of exercises. The 
purpose of these exercises is to allow students to test their assimilation of the material, 
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to challenge their knowledge and ability. Moreover, at the end of each chapter we 
have two special sections: (1) Worked-Out Problems; (2) Supplementary Exercises. 

In each worked-out problems section, I decided to try teach by examples, by 
writing out to solutions to problems. In choosing problems, three major criteria have 
been considered, to be challenging, interesting and educational. Moreover, there are 
many exercises at the end of each chapter as supplementary exercises. They are 
harder than before and serve to present the interesting concepts and theorems which 
are not discussed in the text. 

The list of references at the end of the book is confined to works actually used in 
the text. 


Yazd, Iran Bijan Davvaz 
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Chapter 1 M®) 
Preliminaries Notions Cheak fr 


The concept of a set is used in various disciplines, particularly in group theory. In 
this chapter we review elementary set theory. The concepts of relations, partitions, 
functions, and ordered sets are presented after a discussion of the algebra of sets. 
Many problems in group theory require that we count the number of ways that a 
particular event can occur, so we study the topics of permutations and combinations. 
Moreover, much of group theory involves properties of integers. For this reason the 
last section deals with the properties of integers. We collect the ones we need for 
future reference such as divisibility, prime numbers, and congruences. 


1.1 Sets and Equivalence Relations 


The notion of set is so simple that it is usually introduced informally and regarded as 
self-evident. By a set we mean a collection of objects characterized by some defining 
property that allows us to think of the objects as a whole entity. The defining property 
has to be such that it must be clear whether the given objects belong to the set or 
not. The objects possessing the property are called elements or members of the set. 
We denote sets by common capital letters A, B, C, etc. and elements of the sets by 
lower case letters a, b,c, etc. If S is a set (whose elements maybe numbers or any 
other objects) and x is an element of S, then we write x € S. If it so happens that x 
is not an element of S, then we write x ¢ S. For a property p and an element x of a 
set S, we write p(x) to indicate that x has the property p. A set may be defined by 
a property. The notation A = {x € S | p(x)} indicates that the set A consists of all 
elements x of S having the property p. The empty set is the set having no elements 
which will be denoted by the symbol 9. In set theory there are several fundamental 
operations for constructing new sets from given sets. 
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2 1 Preliminaries Notions 


Definition 1.1 Suppose that A and B are two sets. We say A is a subset of B and 
we write A C B if every element of A is an element of B. 


If A C B, then we also say A is included in B or that B includes A. Thus, A = B 
if and only if A C B and B C A. Note that any set is a subset of itself. At the other 
extreme, @ is a subset of every set. 


Definition 1.2 Let X be a set and A be a subset of X. 


(1) The union of A and B, denoted by A U B, is the set of all elements x such that 
x belongs to at least one of the two sets A or B. 

(2) The intersection of A and B, denoted by A M B, is the set of all elements x which 
belongs to both A and B. 

(3) The difference set of A and B, denoted by A \ B, is the set of all elements x 
which belongs to A and not in B. 

(4) The complement of A respect to X refers to elements of X not in A and denoted 
by AS. 


Two sets are said to be disjoint sets if they have no elements in common. Equiv- 
alently, two disjoint sets are sets whose intersection is the empty set. 


Definition 1.3 A set which contains only finitely many elements is called a finite 
set. A set is infinite if it is not finite. 


The number of elements in a finite set A is denoted by | A|. The study of operations 
of union, intersection and difference, together with inclusion relation, goes with the 
name of algebra of sets. The following identities, which holds for any sets, are some 
of the elementary facts of the algebra of sets. It is routine to check that the following 
identities, for any three sets A, B, and C: 


(1) AUB= BUA, (6) AN(BUC) = (ANB)U(AN 
(2) ANB=BNA, ©); 
(3) AU(BUC) =(AUB)UC, (7) (AU BY = ACN BS, 
(4) AN(BNC) =(ANB)NC, (8) (AN BY’ = AC UBS. 
(5) AU(BNC) =(AUB)N(AU 
C), 


The equalities (7) and (8) are called De Morgan’s Laws. Let A and B be two sets. 
The set of all ordered pair (a, b) witha € Aandb € B iscalled the Cartesian product 
of A and B and is denoted by A x B. Insymbols A x B = {(a,b)|aeA, be B}. 
For the ordered pair (a, b), a is called the first coordinate and b is called the second 
coordinate. For any three sets A, B, and C , we have 


(1) Ax (BNC) =(Ax B)N(Ax ©), (2) Ax (BUC) =(Ax B)U(AxC). 
The simplest numbers are the positive numbers, |, 2, 3, and so on, used for count- 


ing. These are called natural numbers. The set of natural numbers is denoted by N. 
The whole numbers are the natural numbers together with 0. The integer set includes 
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Fig. 1.1 A positive length 
whose square is 2 


Fig. 1.2. Venn diagram for Cc 
well-known number sets 


whole numbers and negative whole numbers. Integers can be positive, negative, or 
zero. The set of integers is denoted by Z. A rational number is any number that can 
be expressed as the quotient or fraction a/b of two integers, a numerator a and a 
non-zero denominator b. The set of all rational numbers is denoted by Q. Now, con- 
sider a square with sides having length one, see Fig. 1.1. If d represents the length of 
the diagonal, then from the geometry we know that 17 + 1* = d?, ie., d” = 2. This 
means that there is a positive length whose square is 2, which we write as /2. But /2 
cannot be a rational number. The fact that ./2 is not rational is proved indirectly, by 
showing that the assumption that \/2 is rational leads to a contradiction. It is evident 
that there are a lot of rational numbers, yet there are “gaps” in Q. Numbers that are 
not rational are called irrational numbers. 

The set of all irrational numbers is denoted by I. These are the numbers that 
we normally use and apply in real-world applications. The set of all real numbers 
is denoted by R. A complex number is formed by adding a real number to a real 
multiple of i, where i = ./—1. Thus, we could write C = {a + bi | a, b € R}, the set 
of complex numbers. Clearly, we have N C ZC QC RC CandI CR. Pictorially 
we can illustrate these inclusions by a Venn diagram (see Fig. 1.2). 

It is quite possible that elements of a set may themselves be sets. For example, 
the set of all subsets of a given set A has sets as its elements. This set is called the 
power set of A and is denoted by P(A). When dealing with sets whose elements 
are themselves sets, it is fairly a common practice to refer them as family of sets; 
however, this is not a definition. Let J be a set and assume that with each element 
i € J there is associated a set A;. The family of all such sets A; is called an indexed 
family of sets indexed by the set J and is denoted by {A; | i € 7}. We can extend the 
notion of union and intersection to arbitrary family of sets. 
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Definition 1.4 Let F be a family of sets. 
(1) The union in F denoted by |) A is the set of all elements that are in A for 


AcF 
some A € F. Thatis _) A= {x € U|x € A for some A € F}. 
ACF 
(2) The intersection in F denoted by () A is the set of all elements that are in A 
AcF 
for all A € F. Thatis () A= {x € U|x €A forall A € F}. 
ACF 


Let X and Y be sets, not necessary distinct, and let R be a relation from X to Y. 
Then, the inverse R~! of the relation R is the relation from Y to X such that R~! = 
{(y, x) | (x, y) € R}. The domain of R denoted by Dom(R) is the set Dom(R) = 
{x € X | (x, y) € R for some y € Y}. The image of R, denoted by Im(R), is the set 
Im(R) = {y € Y | (x, y) € R for some x € X}. 


Definition 1.5 A relation R on a set X is called an equivalence relation if it is 
reflexive, symmetric, and transitive, i.e., for allx, y,z € X 


(1) xRx (reflexive), 
(2) xRy implies yRx (symmetric), 
(3) xRy and yRz imply x Rz (transitive). 


Example 1.6 (1) Let X = Rand define the square relation R = {(x, y)| x? = y7}. 
The square relation is an equivalence relation. 

(2) Let R be the relation on Z x Z such that (x, y)R(z, t) if and only if x +t= 
y +z. The relation R is an equivalence relation. 


Example 1.7 (1) Let P denote the set of people. Let R be the relation on P such 
that for all x, y € P is defined as x Ry if and only if x is the sister of y. Then R 
is not an equivalence relation. 

(2) The relation “is less than or equal to”, denoted “<”, is not an equivalence relation 
on the set of real numbers. For any x, y, z € R, “<” is reflexive and transitive 
but not necessary symmetric. 


Given a non-empty set X, there always exist at least two equivalence relation on 
X. One of these relations is R = X x X and another one is the diagonal relation Ax 
(also called the identity relation) defined by Ay = {(x, x) | x € X} which relates 
every element with itself. Pictorially, if X is represented as a line interval, then X x X 
is a square and A is the main diagonal of the square (see Fig. 1.3). The relation Ay 
is the smallest equivalence relation on X, whereas X x X is the largest. 


Fig. 1.3. Diagonal relation 
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Fig. 1.4 Equivalence classes 
of the relation R soe | 


If R is an equivalence relation on X, we define the equivalence class x € X to 
be the set [x] = {y € X | xRy}. We write X/R to show the set of all equivalence 
classes. The procedural version of this definition is 


VyeX, ye [x] © xRy. 


Example 1.8 Let X = {a, b, c, d, e, f, g} and define the relation R on X as 
follows: 


R= {(a,a), (6,6), (c,c), 4), (,e), (Af), (8,8), 4), Go), 
(c,a), (bd), (d,b), (@, f), (f:2), (8), (8,2), (8), (8, AI. 


Then, R is an equivalence relation. We can easily draw a picture illustrating how 
the equivalence classes divided X into three pieces. In Fig. 1.4, we see that X/R = 


{{a,c}, {bd}, fe, fai}. 


Theorem 1.9 Let R be an equivalence relation on a set X. Then the following are 
equivalent. 


(1) xRy (the elements are related); 
(2) [x] = Ly] (the classes intersect non-trivially); 
(3) [x] A Ly] # @ (the classes are equal). 


Proof (1 = 2): Assume that x Ry. Let c € [x] be an arbitrary element. Then, we 
have xRc. Since xRy and R is symmetric, it follows that yRx. Moreover, since 
yRx and x Rc, it follows that yRc. This yields that c € [y]. So, we conclude that 
[x] C [y]. Similarly, we can show that [y] C [x]. Consequently, we have [x] = [y]. 

(2 = 3): Since x € [x], it follows that [x] is non-empty, so the result is obvious. 

(3 = 1): Suppose that [x] Ny] 4%. Then, there exists c € [x] N[y]. This 
implies that x Rc and yRc. By the symmetry and transitivity properties, we obtain 
xRy. | 


Corollary 1.10 Jf X is a set, R is an equivalence relation on a set X, and x and y 
are elements of X, then either [x] Ny] = @ or [x] = [y]. 


A partition of a set X is a finite or infinite family of non-empty, mutually disjoint 
subsets whose union is X. 
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Fig. 1.5 A partition of the 
set X 
a 


Definition 1.11 A partition of a set X is a family of disjoint non-empty subsets of 
X that have X as their union. In other words, the family of subsets A;, i € J (where 
T is an index set) forms a partition of X if and only if 


(1) A; 4 @, for eachi € J; 
(2) Aj NA; =@ wheni 4 j; 
3) UA =X. 
iel 
Example 1.12 Figure 1.5 shows that the family A;, Az, A3, A4, and As of subsets 
of X forms a partition of X. 


Theorem 1.13 Let R be an equivalence relation on a non-empty set X. Then, the 
family of equivalence classes, X/R, forms a partition of X. 


Proof Clearly, X/R = {[x] | x € X} is a family of non-empty subsets of X. Next, 
if [x] 4 Ly], then by Corollary 1.10, we have [x] N[y] = @. Finally, we have to 


show that (J [x] = X. This is also trivial, because each x € X belongs to [x]. This 
xeX 
completes the proof. | 


Theorem 1.14 Given a partition {A; | i € I} of the subsets of X, then there is an 
equivalence relation R that has the sets A; (i € I) as its equivalence classes. 


Proof We define the relation R on X by x Ry if and only if there exists i € J such 
that x, y € A;. Since each x € X is contained in A;, for some i € J, it follows that 
xRx, i.e., R is reflexive. The symmetry of R is a result of the definition. In order 
to prove the transitivity, assume that x, y, and z are three arbitrary elements of X 
satisfying xRy and yRz. Then, by the above definition, there exist i, 7 € J such 
that x, y € A; and y, z € A;. This implies that y € A; MN A;. So, we conclude that 
A; = Aj. Consequently, x, z € A;, and hence x Rz. Therefore, R is an equivalence 
relation on X. 

Now, let x € X be an arbitrary element. Then, there is one and only onei € J such 
that x € A;. Consequently, we have [x] = A;. So, we showed that each equivalence 
class in X/R is a set in the family {A; | i € 7}. Conversely, suppose that A; is any 
set in the partition. Since A; is non-empty, it follows that there is x € X such that 
x € A;. This yields that A; = [x]. | 
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Exercises 


1. Prove that the De Morgan rules for {A; | i € J},er, a family of subsets of U, 


namely 
@ (UA) = 48 b) (MAi) =U As. 
iel iel iel iel 
2. Let A, B, and C be sets. Prove that 
(a) A\ (BUC) =(A\ B)N(A\C); (b) A\ (BNC) =(A\ B)U(A\C). 
3. Let A= {1, 2, 3, ..., 10}. Prove that if we select any 20 ordered pairs from 


A x A, then we can always find two of the chosen pairs that give the same sum 
when the numbers within the pair are added together. 

4. How many relations are there on a set with n elements? 

5. Let R be a relation from X to Y and let A be any subset of X. Define R(A), 
called R-image of A by R(A) = {y € Y | aRy, for some a ¢€ A}. If A and B 
are subsets of X, prove that 


(a) R(AU B) = R(A)U R(B); (b) R(AN B) € R(A)N R(B). 


6. Let X be the set of all lines. For any x, y € X define x Ry if and only if x is 
perpendicular to y. Show that R is symmetric but neither reflexive nor transitive. 
7. Let X = {a, b, c}. 


(a) Find all relations on X; 
(b) Find all equivalence relations on X; 
(c) Find all those relations on X which are reflexive but neither symmetric nor 


transitive; 
(d) Find all those relations on X which are symmetric but neither reflexive nor 
transitive; 
(e) Find all those relations on X which are transitive but neither reflexive nor 
symmetric; 
(f) Find all those relations on X which are neither reflexive, nor symmetric, nor 
transitive. 
8. For each real numbers a and b, define a Rb if a — b is an integer. Show that R is 
an equivalence relation on the set of real numbers. 
9. For each integers a and b, define aRb if ab > 0. Is R an equivalence relation on 


Z? 

10. Let X = R x R. Define a relation R on X by (x, y)R(z, t) if and only if the 
distance from (x, y) to (z, f) is an irrational number. Determine whether or not 
R is an equivalence relation. 

. Show that given any family of equivalence relations on a set X, their intersection 
is an equivalence relation on X. 


— 
— 
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12. Let X = {1, 2, ..., 10}. For every k € X, define 
Ax, = {B € P(X) | k is the least element of B}. 


Let P = {{H}, Ai, ..., Ato}. 


(a) Prove that P is a partition of P(X); 
(b) Let R be the equivalence relation on P(X) corresponding to P. How many 
equivalence classes does R have? 


1.2 Functions 


Let X and Y be two sets. A function f from X into Y is a subset of X x Y (and 
hence is a relation from X into Y) with the property that each x € X belongs to 
precisely one pair (x, y). In other words no two distinct ordered pairs have the same 
first coordinate. Instead of (x, y) € f we usually write y = f(x). Then y is called 
the image of x under f . The set X is called the domain of f. The range of f is 
the set {y € Y | y= f(x) for some x € X}. That is, the range of f is the subset 
of Y consisting of all images of elements of X. If f is a function from X into 
Y, we write f : X — Y. A constant function is a function whose range consists 
of a single element. We can define some operations on functions. New functions 
are formed from given functions by adding, subtracting, multiplying, and dividing 
function values. These functions are known as the sum, difference, product, and 
quotient of the original functions. More concisely, let f and g be two functions and 
D the largest set on which f and g are both defined. Then, we define 


(f + g)(x) = f(x) + g(x), (fe)(x) = f(x) + g(x), 
(O. . 
(f — 9)@) = fx) — 8), (LO = Fay 8) #0 


for all x €¢ D. We suppose that the readers are familiar to the elementary functions, 
which include powers of x, roots of x, trigonometric functions, logarithms and expo- 
nential functions, and hyperbolic functions. 


Definition 1.15 Let f : X — Y andg: Y —> Z be two functions. Then, we define 
anew function g o f, the composition of f and g by go f : X — Z whose value at 
x is given by (g 0 f)(x) = g( de (x)), for all x € X. In another notation, we have 


go f ={(x,z) € X x Z| Ay € Y such that (x, y) € f and (y, z) € g}. 


Definition 1.16 A function f : X — Y is said to be 


(1) injective or one to one if f (x1) = f (x2) implies x) = x2, 
(2) surjective or onto provided that y € Y, then there exists at least x € X such that 


y= f(x), 
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Fig. 1.6 Image of A f 
under f 
xX Y 
Fig. 1.7 Inverse image of B f 
under f 
xX Y 


(3) bijective if it is one to one and onto. A bijective is also called a one to one 
correspondence. 


Definition 1.17 Two sets are said to have the same cardinality if there exists a one 
to one correspondence between them. An infinite set is said to be countably infinite 
(enumerable) if it has the same cardinality as the set of positive integers. An infinite 
set is uncountably infinite (non-denumerable) if it is infinite but not countably infinite. 


Definition 1.18 Let f : X — Y beafunction and let A C X and B C Y. The image 
of A under f, which we denote by f(A), is 


F(A) = {f@) | x € A}. 
The inverse image of B under f, which we denote by f~!(B), is 
f-'(B) = {x € X | fx) € B}. 


The reader may picture this as illustrated in Figs. 1.6 and 1.7. 


Lemma 1.19 Let f : X — Y be a function. 


(1) AC BCX, then f(A) ¢ f(B); 
(2) fC © DCY, then f-(C) © f(D). 


Proof The proof is left as an exercise. a 


Theorem 1.20 Let f : X — Y be a function and let {A; | i € I} be a family of 
subsets of X. Then 
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Q) #(U 4) =U sos 2) £(M4) SA FAD. 
iel iel iel iel 
Proof The proof is left as an exercise. a 


The inclusion symbol in Theorem 1.20 (2) may not be replaced by an equal sign, 
as the next example shows. 


Example 1.21 Suppose that X = {a, b, c}, Y = {d, e}, J = {1, 2}, A; = {a, b}, 
Az = {b, c},andlet f : X — Yisdefinedby f(a) = f(c) = dand f(b) = e. Then, 
we have f(A; M Az) = f({b}) = {e}, whereas f(A1) N f(A2) = {d, e}. This yields 
that f(A1M Aa) = f({b}) # (Ai) 0 f (Ad). 


Theorem 1.22 Let f : X — Y be a function and let {B; | i € I} be a family of 
subsets of Y. Then 


Q) #"'(U Bi) =U Sp: @ F(NB)=A Fp. 

iel iel iel ie! 
Proof The proof is left as an exercise. a 
Exercises 


1. Let f : X > Y be a function, and let B C Y. Prove that f-'(Y \ B) = X \ 
f7(B). 

2. Let f : X — Y bea function, and let A and B be subsets of X. Give an example 
which shows that it is not true that f(A \ B) = f(A) \ f(B). 

3. Let f : X — Y bea function, and let A C X and B C Y. Prove that 


(a) AC f-'(f(A)); (b) f(f-'(B)) CB. 


4. Let f : X — Y bea function, and let A C X and B C Y. Give examples which 
show that the following statements are not true. 


(a) f-'(f(A)) = 4; (b) f(f7'(B)) = B. 
5. Let f : X — Y bea function, and let A C X and B C Y. Prove that 
(a) f(AN f7'(B)) = f(A) OB; (b) f(f-'(B)) = f(O NB. 


6. Prove that there is a one to one correspondence between the set N of natural 
numbers and the set of all even natural numbers. 

7. Prove that there is a one to one correspondence between the set Z of integers and 
the set of all odd integers. 
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1.3. Ordered Sets 


In this section, we look at certain kinds of ordered sets and then we introduce the 
notion of Hasse diagram. 


Definition 1.23 Let R be a relation on a set X. We say that R is antisymmetric if 
for all x, y € X,xRy and yRx imply x = y. 


Definition 1.24 A relation R on a set X is called a partial order relation if it is 
reflexive, antisymmetric, and transitive. 


Example 1.25 Two fundamental partial order relations are the relation “<” on a set 
of real numbers and the relation “C” on a family of sets. 


Because of special role played by the relation < in the study of partial order 
relations, the symbol ~ is often used to refer to a partial order relation. 


Example 1.26 Givenn € N and let X be the family of partitions of {1, 2, ..., nb}. 
For A, B € X define A ~ B if and only if each class of A is contained in a class of 
B. Then, (X, X) is a partially ordered set. 


Suppose that ~< is a partial order relation on a set X. Elements x and y of X 
are said to be compatible if either x < y or y X x. Otherwise, x and y are called 
non-compatible. 


Definition 1.27 A total order relation < ona set X is a partial order relation such 
that all elements of X are compatible. 


Definition 1.28 A partially ordered set is a pair (X, <), where X is a set and < is 
a partial order relation on X. 

A totally ordered set is a pair (X, <), where X is a set and ~< is a total order 
relation on X. 


Definition 1.29 Let X be a set that is partially ordered with respect to a relation <. 
A subset C of X is called a chain if the elements in each pair of elements in C are 
compatible. 


The length of a chain is one less than the number of elements in the chain. If C is 
a chain in X, then C is a totally ordered set with respect to the restriction of < to C. 
A chain C in a partial order set is maximal if no more elements can be added on it. 

If (X, <) is a partially ordered set, then x < y means x < y and x # y. Given 
x,y € X, we say that y cover x if x < y and there does not exist z € X such that 
X~Z~/y. 


Definition 1.30 The Hasse diagram of a partially ordered set (X, <) is the graph 
with vertices x in X and 


(1) Ifx ~ y, then y is drawn above x in the diagram; 
(2) If y covers x, then there is an edge between x and y in the diagram. 


Example 1.31 Consider Example 1.26 and suppose that X is the family of all par- 
titions of {1, 2, 3}. Let 
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Fig. 1.8 Hasse diagram for As 
the family of all partitions of 
{1, 2, 3} in Example 1.31 ae | ie 
Ag A3 Ag 
At 
Fig. 1.9 Hasse diagram for {a,b,c} 


(P({a, b, c}, S) 


{a, b} {a,c} {b,c} 
{a} {b} {c} 
) 
A; = {{1, 2, 3}}, Ag = {{3}, {1, 2}}, 
A,= {1}, {2, 3} ’ 
A3 = {2}, {1, 3h], Ay= {{1}; (2), 433}. 


Then, X = {A;, Ao, A3, Ag, As}, and we have As ~ Aj, Ao, A3, Ag and 
Az, A3, A4 < A;. The Hasse diagram for this partially ordered set is given in 
Fig. 1.8. 


Example 1.32 Let A = {a, b, c} and X = P(A). By Example 1.25, we know that 
(X, C) is a partial ordered set. The Hasse diagram for this partially ordered set is 
given in Fig. 1.9. 


Definition 1.33 Let (X, <) be a partially ordered set and A C X. 


(1) Anelement u € X is said to be an upper bound for A if a = u, for alla € A; 

(2) Anupper bound up for A is the least upper bound (or supremum) for A if up X u, 
for each upper bound u of A; 

(3) Anelementm € X is said to be maximal if for allx € X,m < x impliesm = x. 


Definition 1.33 has a dual form as follows: 
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Definition 1.34 Let (X, <) be a partially ordered set and A C X. 


(1) Anelement/ € X is said to be a lower bound for A if 1 < a, for alla € A; 

(2) A lower bound Ip for A is the greatest lower bound (or infimum) for A if 1 < lp, 
for each lower bound / of A; 

(3) Anelementm’ € X is said to be minimal if for allx € X,x < m' impliesm' = x. 


Example 1.35 Let A be a non-empty set and B be a subset of the partially ordered 


set (P(A), C). Then the least upper bound of B is ) B, and the greatest lower 
BeB 
bound of Bis (| B. 
BeB 


Zorn’s lemma, also known as Kuratowski-Zorn lemma originally called maximum 
principle, statement in the language of set theory is often used to prove the existence 
of a mathematical object when it cannot be explicitly produced. 


Theorem 1.36 (Zorn’s Lemma) Let (X, <) be a partially ordered set in which every 
totally ordered subset has an upper bound. Then, X has a maximal element. 


Exercises 


1. Let F be the set of all functions f : R > R and let 
R={(f.g)¢€ F x F| f(x) < g(x), forall x € R}. 


Prove that (F, R) is a partially ordered set. 

2. Let A = {a, b, c, d} and consider the partially ordered set (P(A), C). Find all 
upper bounds, all lower bounds, the least upper bound, and the greatest lower 
bound for the set {{a, b}, {a, b, d}}. 

3. Let X = {dao0, G10, 411, 412, 413, 420, 421, A22} be a partial ordered set with 
Hasse diagram shown in Fig. 1.10. Let A = {ao9, a9, @12}. Determine 


(1) The upper and lower bounds of A; 
(2) The least upper bound and greatest lower bound of A if they exist; 
(3) The maximal and minimal elements of X. 


4. Let (X, <) be a partially ordered set and A C X. If the least upper bound (or 
greatest lower bound) of A exists, prove that it is unique. 

5. Let (X1, <1) and (X2, <2) be two partially ordered sets. The lexicographical 
order < on X, x Xz is given by specifying that 


(a,b) X (c,d) &@ axcor(a=candb <a). 
Prove that (X; x X2, <) is a partially ordered set. 


6. If (X, <) is a partially ordered set and x, y are two non-compatible elements of 
X, prove that there is a partially order x’ extending < such that x <’ y. 
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Fig. 1.10 Hasse diagram for 
Exercise 3 


a20 21 a22 


a10 aii a12 a13 
00 


1.4 Combinatorial Analysis 


Many problems in group theory can be solved by simply counting the number of 
different ways that a certain choice can occur. The mathematical theory of counting 
is formally known as combinatorial analysis. The following principle of counting is 
basic. 


Theorem 1.37 (Basic Principle of Counting) Suppose that two experiments are to be 
performed. Then if the first experiment can result in any one of m possible outcomes 
and if for each outcome of the first experiment, there exist n possible outcomes of 
the second experiment, then together there exist mn possible outcomes of the two 
experiments. 


Proof The basic principle of counting may be proved by enumerating all the possible 
outcomes of the two experiments as follows: 


d,1), €,2), ..., d,n) 
(2,1), (2,2), ..., (2,n) 
ie a: (m,2), ... , (m,n) 


where we say that the outcome is (i, /) if the first experiment results in its ith possible 
outcome and the second experiment then results in the jth of its possible outcome. 
Therefore, the set of possible outcomes consists of m rows, each row containing n 
elements, which proves the result. a 


A generalization version of the basic principle states that if r experiments are such 
that the first may result in any of n; possible outcomes, and if for each of the possible 
outcomes of the first i — | experiments there aren; (i = 1, ..., 7) possible outcomes 
of the ith experiment, then there are a total of njn2...n, possible outcomes of the 
r experiments. 


Example 1.38 How many functions defined on n points are possible if each func- 
tional value is either 0 or 1? Assume that the points are 1, 2,...,. Since f(i) must 
be either 0 or 1 foreachi = 1, 2, ...,, it follows that there are 2” possible functions. 
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How many different ordered arrangements of the letters a, b, and c are possible? 
By direct enumeration we see that there are 6, namely 


abc, acb, bac, bca, cab, cba. 


Each arrangement is known as a permutation. So, a permutation of the set {a,, do, ..., 
d,} is an arrangement of these elements in some order without omissions or repetition. 
Permutations have important role in the group theory. We will study them deeply in 
Sects. 5.1 and 5.2. 

Sometimes we are interested in determining the number of different collections 
of r objects that could be formed from total of n objects. 


We define ("), torr <n by 


(1) = Gop 


where n! denotes the product n(n — 1)...3-2- 1 of the first n consecutive natural 
numbers if n > 0 and 0! = | by convention. 


Theorem 1.39 ) represents the number of possible combinations of n objects 


taken r at a time. 


Proof As n(n —1)...(n—r +1) represents the number of different ways that a 
collection of r objects could be selected from n objects when the order of selection 
is relevant, and as each collection of r objects will be counted r! times in this count, 
it follows that the number of different collection of r objects that could be formed 
from a set of n objects is 


naa Deere dy n! 


This completes the proof. a 


Thus, (") represents the number of different subsets of size r that could be 


selected from a set of n objects when the order of selection is not considered relevant. 
Theorem 1.40 (Binomial Theorem) /fx and y are two variables and n is a positive 
integer, then 
n 
n_ n k. n—k 
(x+y) =X (a) (1.1) 


Proof Consider the product (x; + y,)(x2 + y2)..-(%, + y,). Its expansion consists 
of the sum of 2” terms and each term is the product of n factors. Moreover, each of 
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the 2” terms in the sum contains as a factor either x; or y; for eachi = 1,2,...,n. 
Now, how many of the 2” terms in the sum have a factor k of the x;s and n — k of 
the y;s? Since each term consisting of k of the x;s and n — k of the y;s corresponds 


to a choice of a collection of k from the n values x1, x2, ..., Xn, it follows that there 
exist G) such terms. Finally, if we take x; = x and y, = y(@i = 1,..., n), then we 
obtain Eq. (1.1). | 


n 


Im +m +--+, =n, wedefine(, "4, ) by 


Nj,N2,...,Ny ni!ny!...n,! 


n 


Theorem 1.41 Jf nj +n. +---+n, =n, then ( ) represents the 


N1,N2,...,Nyr 
number of possible divisions of n distinct objects into r distinct collections of respec- 
tive sizes N,,N2,...,Nrp. 


Proof The proof is left as an exercise. a 


Theorem 1.42 (Multinomial Theorem) Jf x), x2,...,x, are variables and n is a 
positive integer, then 


n non n, 
Gi ty hoe) = y ( ) jg Sake 
Ny1,12,.-.+,Mr 
(nq...nr): 
nyte-tnp=n 


Proof The proof is left as an exercise. a 


Theorem 1.43 There are (”" i 7 ") distinct non-negative integer-valued 


sequences (X1,X2,...,X;) Satisfying 
xp tro te +x, =n. (1.2) 


Proof Consider a sequence of n ones and r — | zeros. To each permutation of this 
sequence we correspond a solution of Equation (1.2), namely the solution where 


x, = the number of ones to the left of the first zero, 
X2 = the number of ones between the first and second zeros, 


x3 = the number of ones between the second and third zeros, 


X, = the number of ones to the right of the last zero. 
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For instance, if m = 8 andr = 5, then the sequence (1, 1,0, 0, 1, 1, 1,0, 1, 1, 1) 
corresponds to the solution x; = 2, x2 = 0, x3 = 3 and x4 = 3. It is not difficult 
to see that this correspondence between permutations of a sequence of n ones and 
r — 1 zeros and solutions of Eq. (1.2) is a one to one correspondence. Now, the result 


r—)! . 
follows because there are a 7 permutations of n ones and r — | zeros. | 


Exercises 


1. Prove that 


n—1 syscets : 
2. Prove that there are (" . ) positive integer-valued solutions of 


Xp +X. +s++ +X, HN. 


n— 


3. Prove that there are exactly (,) ( pay 


) solutions of 


Xp txg +++: +X, HN. 
for which exactly k of the xs equal to 0. 


4. In how many ways can n indistinguishable balls be distributed into r urns in such 
a way that the ith urn contains at least m; balls. Assume that 


F 
n= ) m;. 
i=l 


Answer: ‘ 7 Limi aan '). 


n— imi 
5. Show that 


6. Suppose a street grid starts at position (0, 0) and extends up and to the right, see 
Fig. 1.11 
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Fig. 1.11 A street grid 


(0,0) 


A shortest route along streets from (0, 0) to (i, j) is i+ j blocks long, going i 
blocks east and j blocks north. How many such routes are there? Suppose that the 
block between (k, /) and (k + 1, /) is closed, where k < i and! < j. How many 
shortest routes are there from (0, 0) to (i, j)? 


1.5  Divisibility and Prime Numbers 


In this section, all numbers are integers, unless specified otherwise. 

We introduce basic concepts of elementary number theory such as divisibility and 
greatest common divisor. The principal results are Division algorithm, which pro- 
vides a quotient and a reminder when we divide two numbers and Bezout’s Lemma, 
which establish the existence of the greatest common divisor of any two integers as 
a linear combination of those two. 


Definition 1.44 We say d divides n and we write d|n whenever n = cd for some c. 
If d divides n, we also say that d is a divisor or factor of n. The symbol d { n means 
that d does not divide n. 


The Principle of Induction. If S is a subset of Z such that 


dd) leS, 
(2) ne Simpliesn+1 € S, 


then all integers greater than | belong to S. 
The basic assumption we make about the set of integers is the well-ordering 
principle. 


The Well-Ordering Principle. Any non-empty set of non-negative integers has 
a smallest number. 

The well-ordering principle is logically equivalent to the principle of induction. 
We assume that the reader is familiar with this principle and its use in proving 
theorems by induction. 
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Theorem 1.45 (Mathematical Induction) Let P(n) be a statement about the positive 
integers such that 


(1) P(A) is true; 
(2) If P(k) happens to be true for some integer k > 1, then P(k + 1) is also true. 


Then, P(n) is true for eachn > 1. 


Proof Suppose that the theorem is false. Then, by the well-ordering principle, there 
exists a smallest integer m > 1 for which P(m) is not true. Since P(1) is true, it 
follows that m 4 1. So, m > 1. Now, we have | < m — 1 < m, so by the choice of 
m, P(m — 1) must be true. But in this case, by the inductive hypothesis, we must 
have P(m) is true. This is a contradiction. Therefore, we conclude that there can be 
no integer for which P is not true, so the theorem is proved. | 


The following theorem is the foundation of much of what we will do during the 
book. 


Theorem 1.46 (Division Algorithm) [fa and b are integers with b > 0, then there 
exist unique integers q andr such that 


(1) a=bq+r, 
(2) O<r<b. 


Proof Leta and b be fixed integers with b > 0, and let S = {a — bx |x € Zandb — 
ax > 0}. We claim that S is non-empty. There exist two possibilities: 


(1) Ifa => 0, then a — bx is non-negative for x = 0. 

(2) If a < 0, then —a > 0. Since Db is a positive integer, it follows that b > 1. So, 
we get —ab > —a, or equivalently a — ab > 0. Hence, a — bx is non-negative 
forx =a. 


Therefore, we conclude that S is non-empty. Since S is a non-empty subset of N, 
by the well-ordering principle we know that S contains a smallest element, call it r. 
Since r € S, it follows that r is of the form r = a — bg for some q, or equivalently 
a= bq +r.Weclaim thatr < b. Assume that on the contrary r > b. Then,r — b > 
0. So, we have 

r—b=(a—bq)—b=a-—b(q+1) <r. (1.3) 


But r is the smallest element of S, while Eq. (1.3) seems to say there is another 
element of S smaller than r. This is a contradiction. Consequently, r < b. In order 
to complete the proof, we must show that the integers g and r are unique. To do 
this, also, assume that there exist integers g’ and r’ such thata = bq +r = bq’ +r’ 
and 0 <r,r’ < b. Without loss of generality, suppose that r’ > r. Then, we have 
0 = bq +r —bq' —r' = b(q —q')—- (r' —r), or equivalently b(q — q') =r’ —r. 
Since b > 0 and r'—r > 0, it follows that g — q' > 0. But, if g — q' > 0, then 
r'—r > b. This is a contradiction, because r’ — r < b. Hence, we conclude that 
q —q' = 0, or equivalently g = q’,sor =r’. | 
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We say that q is the quotient and r is the remainder obtained when b is divided 
into a. If d divides two integers a and b, then d is called a common divisor of a and 
b. So, 1 is acommon divisor of each pair of integers a and b. 


Definition 1.47 The positive integer d is said to be the greatest common divisor of 
a and b if 


(1) dla and d|b; 
(2) Any divisor of a and b is a divisor of d, i.e., c|a and c|b implies c|d. 


We shall use the notation (a, b) for greatest common divisor of a and b. We 
can extend the definition of greatest common divisor in a straightforward way. 
Given n integers a1, a2, ..., A, not all zero, we define their greatest common divisor 
(a,, G2, ..., G,) to be the greatest integer which divides all the given numbers. 


Theorem 1.48 (Bezout’s Lemma) /f both a and b are not zero, then (a, b) exists, is 
unique, and in addition, we can find integers x9 and yo such that (a, b) = axo + byo. 


Proof Suppose that S = {ax + by | x, y € Z}. Since one of the a or b is not zero, 
it follows that there exist non-zero integers in S. If ax + by is in S, it follows that 
a(—x) + b(—y) is also in S. Thus, S always contains some positive integers. Hence, 
by the well-ordering principle, there is a smallest positive number d € S. Since 
d € S, by the form of the elements of S we know that d = axo + byo, for some 
integers x9 and yo. We claim that d is our required greatest common divisor. First, 
note that if cla and c|b, then claxo + byo, or equivalently c|d. Now, we must show 
that dja and d|b. By the Division algorithm, a = qd +r, where 0 <r <d, i.e., 
a = q(axo + byo) + r. Therefore, we have r = a(1 — gxo) + b(—qyo), so r must 
be in S. Since r < d andr € S, so by the choice of d, we conclude that r cannot be 
positive. Hence, r = 0. Consequently, we have a = qd and so d|a. Analogously, we 
obtain d|b. 

For the uniqueness of d, if d’ > 0 also satisfied d’|a, d'|b and d|d’ for each d such 
that d|a and d|b, then we would have d'|d and d|d’. This yields that d = d’. | 


Theorem 1.49 (Euclidean Algorithm) Given positive integers a and b, where b { a. 
Let ro =a, r) = b and apply the division algorithm repeatedly to obtain a set of 
remainders r2,13,...,1n,'n+1 defined by 


ro=ngdt+hn, 0<nm <r, 
Yr) =12qQ2 + 1's, 0<17r3 <P, 
ln—2 =Tn-19n-1 +1 ns 0 <ry <Tn-1, 
Tn-1 = Tn ns Mh = 0. 


Then, rj, the last non-zero remainder in this process is the greatest common divisor 
of a and b. 
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Proof Inview of the last equality r,_) = rndn we have r,|r;—1. The next to last shows 
that r,|r,-2. By induction, we observe that r,|r;, for each integer 7. In particular, 
r, |r; = b andr,|ro = a. So, r, is acommon divisor of a and b. Now, suppose that c 
be any common divisor of a and b. Then, respectively, we obtain clr, c|r3,..., |n- 
Therefore, we conclude that r,, = (a, b). |_| 


Theorem 1.50 The greatest common divisor has the following properties: 


(1) (,b) = , a); 

(2) (a, (b,c)) = (a, d), ©); 

(3) (ac, bc) = |c\(a, b); 

(4) (a, 1) = (1, a) = 1 and (a, 0) = (0, a) = fal. 


Proof We prove only (3). The proofs of the other statements are straightforward. 
Suppose that d = (a, b) and e = (ac, bc). We wish to prove that e = |c|d. Since 
d = (a, b), it follows that d = ax + by, for some integers x and y. Then, cd = 
acx + bcy, which implies that e|cd. Moreover, since cd|ac and cd|bc, it follows 
that cd|e. Therefore, we conclude that |e| = |cd|, or equivalently e = |c|d. a 


The integers a and bare relatively prime if (a, b) = 1. More generally, the integers 
a1, 2,..., Ay are called relatively prime if (a, a2, ..., @,) = 1, and they are called 
pairwise relatively prime if any two of them are relatively prime. 


Theorem 1.51 Let a and b be two integers. Then, a and b are relatively prime if 
and only if there exist integers x and y such that 1 = ax + by. 


Proof If d = (a, b), then by Theorem 1.48, d = ax + by for some integers x and y. 
Conversely, let 1 = ax + by for some integers x and y. If d = (a, b), then dla 
and d|b. So, d|ax + by, or equivalently d|1. This implies that d = 1. a 


Theorem 1.52 (Euclid’s Lemma) If.a|bc and (a, b) = 1, thena|c. 


Proof Since (a,b) = 1, it follows that 1 = ax + by, for some integers x and y. 
Then, c = acx + bcy. Now, since alacx and a|bcy, it follows that alacx + bcy, or 
equivalently a|c. a 


Theorem 1.53 /f (a,b) = d, then (a/d, b/d) = 1. 


Proof Suppose that (a/d, b/d) = c. Then, c|(a/d) and c|(b/d). So, there exist inte- 
gers u and v such that a = dcu and b = dcv. Hence, dc is a common divisor of a 
and b. If c > 1, then this is a contradiction, because cd > d but d is the greatest 
common divisor of a and b. Therefore, we conclude that c = 1. |_| 


We recall the notions of prime and composite numbers. The principle result is 
Fundamental Theorem of Arithmetic, which shows that each integer greater than 1 
can be represented as a product of prime numbers in only one way apart from the 
order of factors. 
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Definition 1.54 An integer n > | is prime if the only positive divisors of n are 1 
and n. An integer n > | which is not prime is called composite. 


Prime numbers are usually denoted by p, p’, pi, 9,9’; di- 


Theorem 1.55 Let p be a prime number. If p\ab, then p|a or p\b. 


Proof Suppose that p|ab and p { a. Since p has only trivial divisors, it follows that 
(p, a) = 1. Now, by Euclid’s lemma, we conclude that p|b. a 


Theorem 1.55 is easily extended to: 


Corollary 1.56 Let p be a prime number. If p\ayaz...an, then p|a;, for some 1 < 
i<n. 


Proof The result follows by induction. | 


Theorem 1.57 Each integer n greater than | is either a prime number or a product 
of prime numbers. 


Proof The existence of such a factorization is proved by induction. Assume that 
each integer less than n can be written as a product of primes. If n is a prime, then 
we have a factorization of n consisting of one prime factor. If n is composite, then 
n =n4n2 with 1 < nj, n2 <n and it follows from the induction hypothesis that each 
of n; and np is a product of primes. Therefore, we conclude that n is a product of 
primes too. | 


There is a certain uniqueness about the composition of an integer into primes. We 
make this precise now. 


Theorem 1.58 (Fundamental Theorem of Arithmetic) Each integer n greater than 
1 can be expressed as a product of primes in a unique way apart from the order of 
the prime factors. 


Proof We use induction on n. The theorem is true for n = 2. Suppose that it is true 
for all integers greater than | and less than n. We shall prove it is also true for n. If 
n is prime, then there is nothing more to prove. Suppose that the theorem is false for 
n. So, n is composite with two different factorizations. Let 


n= Pip2r---Pk = 4192---; (1.4) 


where each p; and q; is prime and where the two factorizations are different. Since 
Pildiq2---q, by Corollary 1.56, it follows that p; divides one of the prime numbers 
41; 92,---,q1- Renumbering these numbers, we may assume that p;|qi, which of 
course means that p; = q. Dividing n by p; we obtain a smaller number 


n 
__ = P2P3--- Pk = 9293 ---1- 
Pi 
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Ifk > lor] > 1, then 1 < n/p, <n. By induction hypothesis, n/p, has the unique 
factorization property. So, the two factorization of n/p, must be identical apart from 
the order of the factors. Consequently, k =/ and the factorizations in (1.56) are 
also identical apart from the order. Therefore, we see that the primes arising for the 
factorization of n are unique. This contradicts the lack of such uniqueness for n. This 
completes the proof. | 


In the factorization of an integer n, a particular prime p may occur more than 
once. If the distinct prime factors of n are pj, p2,..., px and if p; occurs as a factor 
a; times, we can write n = pi'' pS’... py". 


Theorem 1.59 (Euclid’s Theorem) There is an infinite number of primes. 


Proof Suppose that there are only a finite number of primes, say pj, p2,..., Dk. 
Now, consider the integer N = 1+ pi po... pe. Since N > 1, it follows that either 
N is prime or N is a product of primes. Since N > p;, foreach | <i < k, it follows 
that N cannot be prime. Moreover, no p; divides N (if p;|N, then p; divides the 
difference N — pi p2... px = 1). Thus, N is not prime nor is divisible by any prime. 
This violates Theorem 1.57. Therefore, we deduce that the assumption that there are 
only finitely many primes is not true. a 


Theorem 1.60 fn > 1 is composite, then n has a prime divisor p < /n. 


Proof Letn = ab, where 1 < a, b < n. We claim that at least one of the a or b is less 
than or equal to ./n. For if not, thena > /nandb > ./n. Thus,n = ab > J/nn = 
n, which is impossible. Without loss of generality, we assume that a < ./n. Since 
a > 1, it follows that there exists a prime number p such that p|a. Since pla and 
a|n, it follows that p|n. Consequently, we have p < a < ./n, and this completes the 
proof. | 


Gauss introduced a remarkable notation which simplifies many problems con- 
cerning divisibility of integers. 


Definition 1.61 Let > 0 bea fixed integer and a, b € Z. We define a = b(mod n) 
if n|a — b. This relation is referred to as congruence modulo n, and n is called the 
modulus of the relation. 


Lemma 1.62 The relation congruence modulo n defines an equivalence relation on 
the set of integers. 


Proof For eacha € Z, since n|a — a, it follows that a = a(mod n), i.e., the relation 
= is reflexive. Further, for each a, b € Z, if a = b(mod n), then n|a — b and so 
n|b — a, which implies that b = a(mod n), i.e., the relation = is symmetric. Finally, 
for each a, b,c € Z, if a = b(mod n) and b = c(mod n), then nla — b and n|b — c. 
Consequently, n|a — b + b — corn|a — c and hence a = c(mod n), i.e., the relation 
= is transitive. | 


Theorem 1.63 [fa = b(mod n) and c = d(mod n), then we have 
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(1) ax +cy = bx + dy(mod n), for all integers x and y; 

(2) ac = bd(mod n); 

(3) a” = b" (mod n), for all positive integer m; 

(4) f(@ = f(b)(mod n), for each polynomial f with integer coefficients. 


Proof The proof is left as an exercise. a 


Theorem 1.63 tells us that two congruences with the same modulus can be added, 
subtracted, or multiplied, member by member, as though they were equations. 


Theorem 1.64 [fc > 0, then a = b(mod n) if and only if ac = bc(mod nc). 
Proof We have n|b — a if and only if cn|c(b — a). | 


Theorem 1.65 [fac = bc(mod n) and d = (n, c), then 
n 
= b(mod —). 
a = b(mod “) 


Proof Since ac = bc(mod n), it follows that n|c(a — b). This implies that (n/d) 
|\(c/d)(a — b). Since (n/d, c/d) = 1, it follows that n/d|a — b. | 


Theorem 1.66 a= b(mod n) if and only if a and b leave the same remainder when 
divided by n. 


Proof Suppose thata = b(mod n). Then, b = a + nq, for some integer q. If a leaves 
the remainder r when divided by n, then we have a = nq’ +r with O<r <n. 
Thus, we get b=a+ng =nq’+r+nq=n(q'+q)+r, so b leaves the same 
remainder when divided by n. 

The converse is straightforward and we omit the proof. a 


Theorem 1.67 Jf (a,n) = 1, then the congruence ax = b(mod n) has a solution 
x = c. In this case, the general solution of the congruence is given by x = c(mod n). 


Proof Since (a,n) = 1, it follows that ar + ns = 1, for some integers r and s. 
This implies that arb + nsb = b, or equivalently abr = b(mod n). This yields that 
c = br isa solution of the congruence ax = b(mod n). In general, if x = c(mod n), 
then ax =ac= b(mod n). Now, we claim that any solution of ax = b(mod n) is 
necessary congruent to c modulo n. We already know that ac = b(mod n). Then, 
ax —ac= 0(mod n). Consequently, n|a(x — c). Since (a, n) = 1, we conclude that 
n|x — c and so x = c(mod n). | 


An important special case occurs when n is a prime p. 


Corollary 1.68 If p is a prime number, then the congruence ax = b(mod p) has a 
unique solution x modulo p provided a € O(mod p) 
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Exercises 

1. If (a, b) = (a,c) = 1, prove that (a, bc) = 1. 

2. If (a, b) = 1 and dla + BD, prove that (a, d) = (b, d) = 1. 

3. Find a positive integer n such that 37|n, 47|n + 1 and 5?|n + 2. 

4. The Fibonacci sequence 1, 1, 2, 3, 5, 8, 13, 21, 34,... is defined by the 


recursion formula a); = dy + Gn—1 With a, = a2 = 1. Prove that (ay, dn41) = 
1, for each positive integer n. 
5. Let n be a composite positive integer and let p be the smallest prime divisor of 
n. Show that if p > ¥/n, then n/p is prime. 
6. If p is a prime, prove that one cannot find non-zero integers a and b such that 
a? = pb’. 
7. Let n be a composite positive integer and let p be the smallest prime divisor of 
n. Show that if p > ./n, then n/p is prime. 
8. If 2” — | is prime, prove that n is prime. 
9. If 2” + 1 is prime, prove that n is a power of 2. 
10. Prove that 5n? + 7n° = O(mod 12) for all integer n. 
11. If (m,n) = 1, given integers a and b, prove that there exists an integer x such 
that x = a(mod m) and x = b(mod n). 
12. Prove that each number of the set of n — | consecutive integers 


ni+2, n!+3,..., n!tn 


is divisible by a prime which does not divisible any other integer of the set. 


1.6 Worked-Out Problems 


Problem 1.69 Show that 


Solution We arrange 1+ 2+ .----+n red circles in a triangle, mirror it, and we 
observe that this fills all positions of a square of size (n + 1) x (n+ 1) except for 
the diagonal. Figure 1.12 shows this for n = 9. Consequently, we obtain 


2)°k= (41 -@t+)=nnt dD. 
k=1 


Now, dividing by 2 gives the result. | 
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Fig. 1.12 A picture for describing the proof 


Problem 1.70 Prove that 


Solution We double count strings over the alphabet {a, b, c} of length n. 

First way: There are three possibilities per character. Hence, there exist 3” possi- 
bilities in total. 

Second way: First, choose the number of times k that the letter a should be used 


(0 < k <n). Then, choose the position for those characters, there are ) possibil- 


ities. Finally, choose for each of the remaining n — k positions if they should be b or 
c, so there are 2”~* choices. Therefore, in total, there is 


n 
k(n 
u(y) 
k=0 
number of possibilities. By reversing the order of summation, we get 


ra(,")=L2(}). 


This completes the proof. | 


Problem 1.71 Ifa and D are integers, the least common multiple of a and b, written 
as [a, b], is defined as that positive integer d such that 

(1) ald and bld; 

(2) Ifa|c and bIc, then d|c. 


If a and b are two positive integers, prove that 


ab 
(a,b) 


[a,b]= 
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Solution Suppose that d = (a, b). If d = 1, then any common multiple of a and b 
must be a multiple of ab too. This means that [a, b] = ab. 
Ifd > 1, then (a/d, b/d) = 1. So, according to the above discussion, we have 


8-35 


Hence, we get d?[a/d,b/d|=ab, and so d[a,b]=ab, or equivalently 
(a, b)[a, b] = ab. This completes the proof. | 


Problem 1.72 Let a and b be two positive integers, where b { a. Show that the 
Euclidean algorithm for calculating (a, b) takes at most 2([log, b| + 1) steps (divi- 
sions). 


Solution Suppose that k = [log, b] + 1. In the notation of Euclidean algorithm, we 
must show that n < 2k. To do this, assume that n > 2k + 1. We claim that rj. < 
r;/2, for each 1 < j <n — 2. We consider the following two cases: 

Case 1: lf rj4, <7; /2, thenrjy2 <rjy) <7;/2. 

Case 2: If rj4, > 7;/2, thenr; = rj4igj41 +1rj+42, where gj41 = 1. So, we have 
rj42 <j —Vj+1 <rj/2. 

Consequently, we obtain 


ln — 
1<r, < 5 < a ae 5: Ee Gt 


and so k < log, b. This is a contradiction, because 
k = [log, b] + 1 > log, b. 


This completes the proof. | 


Problem 1.73 If p is a prime, prove that 


a a 
(2) = aq api = ened Po. 
foreach 1 <k < p—l. 


Solution We have 
p= (2 )@ ble, 


Clearly, p divides the left side, so p must divide one of the factors on the right side. 
But all factors of (p — k)! and k! are less than p, this yields that p { (p — k)! and 


p{k!. Therefore, we conclude that p| ( ). This completes the proof. | 
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Problem 1.74 (Fermat’s Little Theorem) For any prime p and any integer a, prove 
that a? = a(mod p). Note that this statement is equivalent to: If p is a prime and a 
is an integer with (a, p) = 1, then a?~'! = 1(mod p). 


Solution Ifa = 0, the result is clear. Leta > 0 and we apply mathematical induction 
on a. The result is true for a = 1. Suppose that the congruence is true for a, and we 
prove it for a + 1. By using Problem 1.73, we obtain 


P 


@+pP=> (Pat =a? +1=a+41(mod p). 
k=0 


This proves the congruence for positive integers. Since each integer is congruent to 
a positive integer modulo p, the result follows. a 


Problem 1.75 (Wilson’s Theorem) For each prime p, 
(p — 1)! = —1(mod p). 


Solution If p = 2, the result is clear. So, we suppose that p > 2. For each | < 
a < p—1, there exists a unique 1 < a’ < p —1 satisfying aa’ = —1(mod p). If 
a=1ora=p-— 1, thena = a’. The converse statement also is true, because a’ = 
a implies that (a — I)(a+1) = a* — 1 = 0(mod p), so that a = —1(mod p) or 
a = p — \(mod p). The remaining numbers, i.e., 1 < a < p — 1, can be considered 
in s = (p — 3)/2 pairs (a, a’), say (a, a}),..., (as, a). Therefore, we obtain (p — 
1)! =1 x (p—1) x (aia)... (sal) = p — 1 = —1(mod p), as desired. | 


1.7 Supplementary Exercises 


1. Let U bea set. For any two subsets of U we define A + B = (A \ B) U(B \ A) 
and A- B = ANB. Prove that 


(a) A+ B=B+A; (e) A+ (B+C)=(A+8B)+C; 
. eee (f) If A+B=A+C, then B=C; 
(d) A+A=9Q; (g) A-(B+C)=A-B=A-C. 


2. Let A, B, and C be three finite sets, prove that 
|AUBUC|=|A|+|B]+|C] —|AN B|—|ANC|—-|BNC|+|ANBNCl. 


More generally, for finite sets A,, Az,..., An, prove that 
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=SAI- So An Alt SO 1A; A; A; 
i=1 


l<i<j<n l<i<j<k<n 


+... t(D" ALN AN... Agl. 


Lay 
i=l 


io) 


. If0 <m <n, how many subsets are there that have exactly m elements? 
4. Given sets A = {a, b, c, d, e, f}and B = {a, b, c, d, e}. Compute |P(A) \ 
P(B)|. 
5. Let X and Y be two sets having m and n elements, respectively. How many 
relations of X into Y exist? 
6. Let X = Z x (Z \ {0}). Define a relation R by declaring that (a, b) R(c, d) if 
and only if ad = bc. Prove that the relation R is an equivalence relation. 
7. Let S be the set of all integers of the form 23”, m > 0, n > 0. Show that there 
is a one to one correspondence between S and N. 
8. If X is a finite set and f is a function of X onto itself, show that f must be one 
to one. 
9. If X is a finite set and f is a one to one function of X into itself, show that f 
must be onto. 
10. Let f : X — Y bea function, and let A C X and B C Y. Prove that 


(a) If f is one to one, then f~'(f(A)) = A; 
(b) If f is onto, then f(f—!(B)) = B. 


11. Prove that the set of all rational numbers is countable. 

12. Let (X, x) be a partially ordered set in which every totally ordered subset has a 
lower bound. Show that X has a minimal element. 

13. A partially ordered set (L, <) is called a lattice if every subset {x, y} with two 
elements has a (unique) least upper bound, denoted by x v y, and a (unique) 
greatest lower bound, denoted by x A y. Prove that a lattice in which every chain 
has an upper bound has a unique maximal element. 

14. Ifaset has 2n elements, show that it has more subsets with n elements than with 
any other number of elements. 


15. Show that , 
Ce ee ee) 


Hint: Make use of Theorem 1.43. 
16. Find integers n and r such that the following equation is true: 


(=) +2() 7 (3) = (7). 


17. Ifa =p"... pe andb == pf! ... pe*, where the p; are distinct prime numbers 
and where each a; > 0 and 6; > 0, prove that 


(a) (a,b) = p{'... pj’, where y; = min{a;, 6;}, for each i; 
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(b) [a, b] = pi ; “Des where 6; = max{a;, B;}, for each i. 


18. Prove that (a, b) = (a+ b, [a, b]). 
19. Show that n! + 1 and (n + 1)! + 1 are relatively prime for all positive integers. 
20. Find a rule for divisibility by 11. 
21. Find a simple characterization of all numbers which are sums of two squares. 
22. Find the least positive integer n such that n|2” — 2 but n { 3” — 3. 
23. Ifa; <a <... <a, are integers, show that 
qj — aj 
im j 


I<i<j<n 


is an integer. 


Chapter 2 M®) 
Symmetries of Shapes cies 


In this chapter, we are interested in the symmetric properties of plane figures. By a 
symmetry of a plane figure we mean a motion of the plane that moves the figure so 
that it falls back on itself. 


2.1 Symmetry 


One of the most important and beautiful themes unifying many areas of modern 
mathematics is the study of symmetry. Many of us have an intuitive idea of symmetry, 
and we often think about certain shapes or patterns as being more or less symmetric 
than others. In this chapter we sharpen the concept of “shape” into a precise definition 
of “symmetry”. 


Definition 2.1 A transformation of the plane is a function f : R? > R?. 


Transformation involves moving an object from its original position to a new 
position. The object in the new position is called the image. Each point in the object 
is mapped to another point in the image. 

A geometric shape or object is symmetric if it can be divided into two or more 
identical pieces that are arranged in an organized fashion. This means that an object 
is symmetric if there is a transformation that moves individual pieces of the object, 
but doesn’t change the overall shape. The type of symmetry is determined by the 
way the pieces are organized. 


Example 2.2 Symmetry occurs in nature in many ways; for example, the human 
form is symmetric, see Fig. 2.1. 


Example 2.3 The heart carved out is an example of symmetry, see Fig. 2.2. 
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Fig. 2.1 Symmetry of human form 


Fig. 2.2. The heart carved 
out 


Definition 2.4 The symmetry through a line £ is a transformation of the plane 
which sends point P into point Q such that £ is the midperpendicular to segment 
PQ. Such a transformation is also called the axial symmetry and CL is called the axis 
of the symmetry. If a figure turns into itself under the symmetry through line £, then 
L£ is called the axis of symmetry of this figure. 


The line of symmetry can be vertical, horizontal, or diagonal. There may be one 
or more lines of symmetry, see Fig. 2.3. 


2.1 Symmetry 


Isosceles triangle Rectangle 
one line of symmetry two lines of symmetry 
| 


| 

7 fl x 4 | 
' } 

Equilateral triangle Square 


three lines of symmetry four lines of symmetry 


Irregular quadrilateral Circle 
no line of symmetry infinite lines of symmetry 


Fig. 2.3 Line of symmetry 
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ABCDEFGHIJ 
K<-EaMEN:O: 2-0 Roch 
OV WARNE Z 


Fig. 2.4 Font Geneva 


Exercises 


e 


Prove that the inverse of a bijective transformation is a bijective transformation. 
2. The letters in Fig.2.4 are in the font Geneva. Some of them have one line of 
symmetry, some have two, some have none, and some have point symmetry. The 
latter are invariant under a half-turn. Which ones are in the first set? The second? 
The third? The fourth? 


2.2. Translations 


Translation is a term used in geometry to describe a function that moves an object a 
certain distance. The object is not altered in any other way. It is not rotated, reflected, 
or resized. In a translation, every point of the object must be moved in the same 
direction and for the same distance. 


Definition 2.5 A translation is an object from one location to another, without any 
change in size or orientation. 


A horizontal translation refers to an object from left to right or vice versa along 
the x-axis (the horizontal access). A vertical translation refers to an object up or 
down along the y-axis (the vertical access). In many cases, a translation will be both 
horizontal and vertical, resulting in a diagonal object across the coordinate plane, for 
example, see Fig. 2.5. The trivial translation is the translation through zero distance; 
all other translations are non-trivial. 


Definition 2.6 Let P and Q be two points in a plane. The translation from P to Q 
is transformation f : R? —> R? such that 
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Fig. 2.5 A translation 


Fig. 2.6 The quadrilateral 
PQB'B and AA'B’B are 
parallelograms 


() O= f(P); 

(2) If P = Q, then f is the identity; 

(3) If P 4 Q, let A be any point on P Q and let B be any point of PQ; let A’ = f(A) 
and B’ = f(B). Then quadrilaterals PQB’B and AA'B’B are parallelograms, 
see Fig. 2.6. 


When P 4 Q one can think of a translation as a slide in the direction of vector 
PQ. If A is any point and C is the line through A parallel to PQ, then f(A) is the 
point on £ whose distance from A in the direction of vector PQ is PQ. 


36 2 Symmetries of Shapes 


iv yx 


Fig. 2.7 Some shapes with rotational symmetries 


ve VW 
xX xX 


Fig. 2.8 Positively (+) and negatively (—) oriented angle X OY 


Exercises 


1. Prove that the composition of two translations is a translation. 
2. Prove that 


(a) A composition of translations commutes; 
(b) The inverse of a translation is a translation. 


2.3 Rotation Symmetries 


An equilateral triangle can be rotated by 120°, 240°, or 360° angles without really 
changing it. If you were to close your eyes, and a friend rotated the triangle by one 
of those angles, then after opening your eyes you would not notice that anything 
had changed. In contrast, if that friend rotated the triangle by 33° or 85°, you would 
notice that the bottom edge of the triangle is no longer perfectly horizontal. Many 
other shapes that are not regular polygons also have rotational symmetries. Each 
shape illustrated in Fig. 2.7, for example, has rotational symmetries. 

In order to define the rotation we need the notions of “oriented angle” and of its 
“signed measure’. 

The oriented angle X OY is an angle in which we distinguish the order of its sides 
OX, OY (see Fig. 2.8). If the transition from OX to OY is opposite to the direction 
of the clock’s hands, then we consider the angle as being “positively oriented”, or 
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Fig. 2.9 Example of a rotation 


simply a positive angle. If the transition is in the same direction as the clock’s, we 
consider the angle as being “negatively oriented”, or simply a negative angle. 


Definition 2.7 A rotation in R? is a circular movement of an object around a center 
of rotation. 


Example 2.8 An equilateral triangle can be rotated by 120°, 240°, or 360° angles 
without really changing it. If you were to close your eyes, and a friend rotated the 
triangle by one of those angles, then after opening your eyes you would not notice 
that anything had changed. 


Example 2.9 Figure 2.9 shows that the pre-image A is rotated 90° counterclockwise 
about the center point A to form the rotated image. 


Exercises 


1. Show that the composition of two rotations is a rotation. 

2. Find the image of the ellipse x7/4 + y?/9 = 1 under the 60° rotation about (0, 0). 

3. A rotation of about (—1, 0) is followed by a rotation of about (1, 0). The first 
rotation is applied again after that. Analyze the composite of these three rotations. 
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Fig. 2.10 Beautiful reflections in nature 


2.4 Mirror Reflection Symmetries 


Another type of symmetry that we can find in two-dimensional geometric shapes 
is mirror reflection symmetry. More specifically, we can draw a line through some 
shapes and reflect the shape through this line without changing its appearance. 


Definition 2.10 A reflection is defined by its axis or line of symmetry, i.e., the mirror 
line. Each point P(x, y) is mapped onto the point P’(x’, y’) which is the mirror image 
of (x, y) in the mirror line. This yields that P P’ is perpendicular to the mirror. A 
reflection preserves distances. 


Example 2.11 Many objects in nature appear the same on the left and right; for 
instance, see Figs. 2.10 and 2.11. The left half of a butterfly appears the same as the 
right half, and if we were to place a mirror down the center to reflect the left half, 
the resulting butterfly would look the same as the original, see Fig. 2.11. 


Example 2.12 In Fig. 2.12, we can observe some reflections in nature. 


A glide reflection is a composition of transformations. In a glide reflection, a 
translation is first performed on the figure, then it is reflected over a line, which is 
parallel to the direction of the previous translation. Reversing the order of combining 
gives the same result. Glide reflections with non-trivial translation have no fixed 
points. The composition of a reflection in a line and a translation in a perpendicular 
direction is a reflection in a parallel line. However, a glide reflection cannot be reduced 
like that. Thus the effect of a reflection on a line combined with a translation in one of 
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Fig. 2.11 Butterfly and reflection 


Fig. 2.12 Some reflections in nature 
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Fig. 2.13 Footprints fixed 
by a glide reflection is, «sai, ae, 


the directions of that line is a glide reflection, with a special case as just a reflection. 
Therefore, the only required information is the translation rule and a line to reflect 
over, the resulting orientation of the two figures is opposite. 


Example 2.13 In your mind, picture the footprints you leave when walking in the 
sand. Imagine a line £ positioned midway between your left and right footprints. 
In your mind, slide the entire pattern one-half step in a direction parallel to £ then 
reflect in line £. The image pattern exactly superimposes on the original pattern. 
This transformation is an example of a glide reflection with axis £ (see Fig. 2.13). 


Alternatively, we can think of a glide reflection with axis £ as a reflection in line 
£ followed by a translation parallel to £. 


Definition 2.14 A non-identity transformation f is an involution if and only if f? = 
id. 


Note that an involution f has the property that f = f—!. 


Theorem 2.15 A reflection is an involution. 


Proof Left as an exercise for the reader. a 
Exercises 

1. What conjectures can you make about a figure reflected in two lines? 

2. Prove that a non-identity translation is not a reflection. 

3. Show that the composition of translations and non-trivial rotation is a rotation. 
4. Point P is reflected in two parallel lines, £ and L’, to form P’ and P”. The distance 


from £ to £L’ is 10 cm. What is the distance P P”. 
5. Words such as MOM and RADAR that spell the same forward and backward, 
are called palindromes. 


(a) When reflected in their vertical midlines, MOM remains MOM but the Rs 
and D in RADAR appear backward. Find at least five other words like MOM 
that are preserved under reflection in their vertical midlines. 

(b) When reflected in their horizontal midlines, MOM becomes WOW, but BOB 
remains BOB. Find at least five other words like BOB that are preserved under 
reflection in their horizontal midlines. 
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Fig. 2.14 S(F) is closed under composition of functions 


6. Show that any glide reflection can be written as the composition R o S of a reflec- 
tion R and a rotation S. Comment on the uniqueness of this decomposition. 

7. Show that any glide reflection can be written as the product of reflections in the 
sides of an equilateral triangle. 

8. Find all values for a and b such that f(x, y) = (ay, x/b) is an involution. 


2.5 Congruence Transformations 


We say that two plane figures are congruent if they have the same shape and size. 
In other words, two plane figures are congruent if one figure can be moved so that 
it fits exactly on top of the other figure. This movement can always be affected by 
a sequence of translations, rotations, and reflections. Each part of one figure can be 
matched with a part of the other figure, and matching angles have the same size, 
matching intervals have the same length, and matching regions have the same area. 

For instance, our reflections in a mirror have the same shape and size as we do, 
so we would say that we are congruent to our reflection in a mirror. 


Definition 2.16 A congruence transformation is a transformation under which the 
image and pre-image are congruent. 


We denote the set of all symmetries of a plane figure F by S(F’). The elements 
of S(F) are distance-preserving functions f : R? + R? such that f(F) = F. So 
we can form the composite of any two elements f and g in S(F) to obtain the 
function go f : R* > R’. Let f, g € S(F). Since f and g both map F to itself, 
so must go f; and since f and g both preserve distance, so must go f. Hence 
go f € S(F). We describe this situation by saying that the set S(F) is closed under 
composition of functions, see Fig. 2.14. 


Example 2.17 We consider some examples of composition in S(C), the set of sym- 
metries of the square. Any non-trivial translation alters the location of the square in 
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Fig. 2.15 Rotations and reflections of a square 


Fig. 2.16 Initial position 


the plane and so cannot be a symmetry of the square. Therefore, we consider only 
rotations and reflections as potential symmetries of the square. Figure2.15 shows 
our labeling for the following elements: 


Ro = Rotation of 0° So = Reflection about a horizontal axis 
R, = Rotation of 90° S| = Reflection about a vertical axis 


R> = Rotation of 180° So = Reflection about the main diagonal 
R3 = Rotation of 270° S3 = Reflection about the other diagonal 


We want to find Rj o Sp and S$; o R,. We consider the initial position as Fig. 2.16 
to keep track of the composition of the symmetries. 

Figure 2.17 shows the effect of Rj o So, i.e., first Sp and then R;. Comparing the 
initial and final positions, we observe that the effect of R; o So is to reflect the square 
in the diagonal from bottom left to top right. This is the symmetry that we have called 
Sy, and hence R; o So = So. 


Exercises 


1. With the notation given in Example 2.17, find the following composites of sym- 
metries of the square: R2 o $3, Rz o Sz, and R303. 
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Ry fe} So So 


Fig. 2.17 R, o So = So 


2.6 Worked-Out Problems 


Problem 2.18 Let ABC be a triangle with the vertices labeled clockwise such that 
AC = BC and ZACB = 7/2. Let S4z be the reflection in the line AB, S,4c be the 
reflection in the line AC, and R be the rotation by 2/2 counterclockwise around B. 
Identify the composition R o S4z 0 Sac. 


Solution We can solve problems like this one using the following simple strategy. 
Find three points which form a triangle and see where the composition of isometries 
takes them. Next, it’s time to guess what the isometry is. If your guess is correct for the 
three vertices of the triangle, then it must be correct. And this is because the theorem 
above guarantees that if you know what an isometry does to three corners of a triangle, 
then you know what the isometry does to every point in the plane. Figure 2.18 shows 
triangle ABC drawn on a grid of squares. Since we want to choose three points which 
form a triangle, we may as well choose the points A, B, and C. 


e Itis easy to check that S4c(A) = A, S4g(A) = A and R(A) = P. In other words, 
Ro SAB 0 Sac(A) => P; 

e Itis easy to check that S4c(B) = Q,S4g(Q) = N and R(N) = Q. Inother words, 
Ro Sago Sac(B) = Q; 

e Itis easy to check that Sac(C) = C, Sap(C) = M and R(M) = C. Inother words, 
Ro SAB ° Sac(C) =C. 
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Fig. 2.18 Triangle ABC 
drawn on a grid of squares 


Hence, can you think of an isometry which takes A to P, B to Q, and C to C? If 
you think hard enough, you should realize that it’s just a rotation by z around C. So, 


we have managed to deduce that the composition R o Sag o Sc is a rotation by 7 
around C. a 


Problem 2.19 Let ABC D be a rectangle with the vertices labeled counterclockwise 
such that BC = 2AB. Suppose that 


Sap 1s the reflection in the line AB; 

Rz is the counterclockwise rotation by 2/2 about B; 

Tpp is the translation which takes D to B; 

Gcp is the glide reflection in the line CD which takes C to D. 


Identify the composition S43 o Rg o Tpgp 0 Gcp. 


Solution Figure2.19 shows rectangle ABC D drawn on a grid of squares. Since we 
want to choose three points which form a triangle, we may as well choose the points 
A, B, and C. 


e Itiseasy tocheck that Gcp(A) = E,Tpg(E) = D, Rg(D) = FandS,,(F) = J. 
In other words, S4z 0 Rg o Tpgp 0 Gcp(A) = J; 

e Itis easy to check that Gcp(B) = H,Tpg(A) = C, Rg(C) = J, and S4g(U/) = I. 
In other words, S4zp 0 Rp 0 Tpg o Gcp(B) = 1; 

e It is easy to check that Gcp(C) = D, Tpg(D) = B, Ra(B) = B, and S4g(B) = 
B. In other words, S4g 0 Rg o Tpgp 0 Gcp(C) = B. 


So can you think of an isometry which takes A to J, B to J, and C to B? If you think 
hard enough, you should realize that it is a rotation, although you might not be sure of 
where the center lies. However, we can use the fact that if a rotation takes X to Y, then 
the center of rotation must lie on the perpendicular bisector of xy. In particular, the 
center of the rotation that we are interested in must lie on the perpendicular bisector 
of AJ as well as the perpendicular bisector of BJ. And there is only one point which 
does that namely, the point O labeled in Fig. 2.19. It is now easy to deduce that the 
composition must be a rotation about O by ZAOJ = 2/2 in the clockwise direction. 

a 
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Fig. 2.19 Rectangle ABC D drawn on a grid of squares 


Fig. 2.20 What letter has 
been folded once to make 
this shape? 


Fig. 2.21 A ray of light is 
reflected by two 
perpendicular flat mirrors 


2.7 Supplementary Exercises 


1. Prove that the composition of two reflections with parallel axes is a translation 
perpendicular to these axes by a distance twice that from the first axis to the 
second. 

2. Prove that the composition of two reflections with axes meeting at a point is a 
rotation about that point through an angle twice that from the first axis to the 
second. 
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Prove that the composition of three reflections is a reflection if three axes are 
parallel or concurrent, and otherwise is a glide reflection. 

Five reflections are composed, with axes in order the lines x = 0, x + y = 6, 
y=6,y=x+2,y =x + 8.Is the composition a reflection or a glide reflection? 
Give details. 

What capital letters could be cut out of paper and given a single fold to produce 
Fig. 2.20? 

Give an example of a bijection f : R* — R? that preserves angles but not dis- 
tances. Describe in general terms the effect of f on lines, circles, and triangles. 
Prove that the composition of a non-trivial rotation and a reflection is glide reflec- 
tion except when the axis of the reflection passes through the center of the rotation, 
in which case it is a reflection. 

Aray of light is reflected by two perpendicular flat mirrors. Prove that the emerging 
ray is parallel to the initial incoming ray, as indicated in Fig. 2.21. 


Chapter 3 Mm) 
Groups ona 


The theory of groups is the oldest branch of modern algebra. The concept of a group 
is surely one of the central ideas of mathematics. A group is a set in which you can 
perform one operation with some nice properties. In this chapter we introduce the 
basic concepts of group theory. 


3.1 A Short History of Group Theory 


The concept of a group is one of the most fundamental in modern mathematics. 
Group theory can be considered the study of symmetry: the collection of symmetries 
of some object preserving some of its structure forms a group; in some sense all 
groups arise this way. Although permutations had been studied earlier, the theory of 
groups really began with Galois (1811-1832) who demonstrated that polynomials 
are best understood by examining certain groups of permutations of their roots. Since 
that time, groups have arisen in almost every branch of mathematics. There are three 
historical roots of group theory: 


(1) The theory of algebraic equations; 
(2) Number theory; 
(3) Geometry. 


Euler, Gauss, Lagrange, Abel, and Galois were early researchers in the field of 
group theory. Galois is honored as the first mathematician linking group theory and 
field theory, with the theory that is now called Galois theory. 

Permutations were first studied by Lagrange (1770, 1771) on the theory of alge- 
braic equations. Lagrange’s main object was to find out why cubic equations could 
be solved algebraically. In studying the cubic, for example, Lagrange assumes the 
roots of a given cubic equation are x’, x”, and x’”. Then, taking 1, w, and w” as the 
cube roots of unity, he examines the expression 
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R= x! +4 wx" ae wrx! 


and notes that it takes just two different values under the six permutations of the 
roots x’, x”, and x””. But, he could not fully develop this insight because he viewed 
permutations only as rearrangements, and not as bijections that can be composed. 
The composition of permutations does appear in the works of Ruffini and Abbati 
about 1800; in 1815 Cauchy established the calculus of permutations. 

Galois found that if 71,72, ..., 7, are the n roots of an equation, there is always 
a group of permutations of the rs such that every function of the roots invariable by 
the substitutions of the group is rationally known, and conversely, every rationally 
determinable function of the roots is invariant under the substitutions of the group. 
Galois also contributed to the theory of modular equations and to that of elliptic 
functions. His first publication on the group theory was made at the age of 18 (1829), 
but his contributions attracted little attention until the publication of his collected 
papers in 1846. 

The number-theoretic strand was started by Euler and taken up by Gauss, who 
developed modular arithmetic and considered additive and multiplicative groups 
related to quadratic fields. Indeed, in 1761, Euler studied modular arithmetic. In 
particular, he examined the remainders of powers of a number modulo n. Although 
Euler’s work is, of course, not stated in group-theoretic terms, he does provide an 
example of the decomposition of an abelian group into cosets of a subgroup. He also 
proves a special case of the order of a subgroup is being a divisor of the order of the 
group. 

Gauss in 1801 was to take Euler’s work much further and gives a considerable 
amount of work on modular arithmetic which amounts to a fair amount of theory 
of abelian groups. He examines orders of elements and proves (although not in this 
notation) that there is a subgroup for every number dividing the order of a cyclic 
group. Gauss also examined other abelian groups. He looked at binary quadratic 
forms 

ax? + 2bxy + cy’, 


where a, b, and c are integers. Gauss examined the behavior of forms under trans- 
formations and substitutions. He partitions forms into classes and then defines a 
composition on the classes. Gauss proves that the order of composition of three 
forms is immaterial, so in modern language, the associative law holds. In fact, Gauss 
has a finite abelian group, and later (in 1869) Schering, who edited Gauss’s works, 
found a basis for this abelian group. 

Geometry has been studied for a very long time, so it is reasonable to ask what 
happened to geometry at the beginning of the nineteenth century that contributed to 
the rise of the group concept. Geometry had begun to lose its metric character with 
projective and non-Euclidean geometries being studied. Also the movement to study 
geometry in m dimensions led to an abstraction in geometry itself. The difference 
between metric and incidence geometry comes from the work of Monge, his student 
Carnot, and perhaps, most importantly, the work of Poncelet. Non-Euclidean geom- 
etry was studied by Lambert, Gauss, Lobachevsky, and, Janos Bolyai, among others. 
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MObius in 1827, although he was completely unaware of the group concept, began 
to classify geometries using the fact that a particular geometry studies properties 
invariant under a particular group. Steiner in 1832 studied notions of synthetic geom- 
etry which were to eventually become part of the study of transformation groups. 

Arthur Cayley and Augustin Louis Cauchy were among the first to appreciate the 
importance of the theory, and to the latter especially are due to a number of important 
theorems. The subject was popularized by Serret, Camille Jordan, and Eugen Netto. 
Other group theorists of the nineteenth century were Bertrand, Charles Hermite, 
Frobenius, Leopold Kronecker, and Emile Mathieu. 

It was Walther von Dyck who, in 1882, gave the modern definition of a group. 

The study of what are now called Lie groups, and their discrete subgroups, as 
transformation groups, started systematically in 1884 by Sophus Lie; followed by 
the works of Killing, Study, Schur, Maurer, and Cartan. The discontinuous (discrete 
group) theory was built up by Felix Klein, Lie, Poincaré, and Charles Emile Picard, 
in connection in particular with modular forms. 

Other important mathematicians in this subject area include Emil Artin, Emmy 
Noether, Ludwig Sylow, and many others (Fig. 3.1). 


3.2 Binary Operations 


We are used to addition and multiplication of real numbers. These operations combine 
two real numbers to generate a unique single real number. So, we can look at these 
operations as functions on the set R x R defined by 


+:RxR-R 
(a,b) > a+b 


and 
-:-RxR-R 


(a,b) >a-b 


These operations are examples of binary operations. The general definition of a 
binary operation is as follows: 


Definition 3.1 Let S be a non-empty set. A binary operation on S is a function 
x: S$ x S — S that maps each ordered pair (x, y) of S to an element x(x, y) of S. 
For convenience we write a x b instead of x(a, b). The set S is said to be closed under 
the operation x. The pair (S, x) (or just S, if there is no fear of confusion) is called 
a groupoid. 


Binary operations are usually denoted by special symbols +, —, -, x, *, *, 0, ®, 
©, V, A, U, M rather than by letters. 


ce 


Example 3.2 The ordinary operations of addition “+”, subtraction , and mul- 
tiplication “.” are binary operations on Z, Q, and R. Subtraction is not a binary 
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(a) L. Euler (b) J. L. Lagrange (c) A. F. Mobius 
(1736-1813) (1790-1868) 
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(g) A. Cayley (h) L. Sylow (i) C. Jordan 
(1821-1895) (1832-1918) (1838-1922) 
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(j) F. Klein (1849—- (k) E. Noether (1) E. Artin 
1925) (1882-1935) (1898-1962) 


Fig. 3.1 Pictures of some famous mathematicians 
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operation on N, because 3 — 7 is not in N. Division is not a binary operation on Q, 
however, division is a binary operation on Q \ {0}. 


Example 3.3 Exponential operation a * b = a? is a binary operation on the set N 


of natural numbers while it is not a binary operation on the set Z of integers. 


Binary operators can be defined on arbitrary sets, not only sets of numbers. 


Example 3.4 We might consider a set S of colors, and define a binary operation @ 
which tells us how to combine two colors to form another color. If Red, Blue, Green, 
Yellow, and Purple are elements of S, then we can write Red @ Blue = Purple, since 
we can combine the first two colors to make the third. 


Example 3.5 If X is aset, then union “U” and intersection “NM” are binary operations 
on P(X). 


Example 3.6 Let P be the set of all propositions. Then, “and” and “or” are binary 
operations on P. In mathematical logic “and” is usually represented by A, and “or” 
is represented by v. 


Example 3.7 We noted earlier that binary operations can act not only on numbers 
but also on arbitrary elements, such as colors or other sets. Here we consider the set 
S = {Ro, Ri, Ro, R3, So, Si, S2, $3}as defined in Example 2.17, aset of geometric 
transformations, which we can combine. We can consider the binary operation of 
combining these geometric transformations by doing one and then doing the other. 


Definition 3.8 One way of describing a binary operation « on a set S (provided S is 
not too big) is to form a grid with rows and columns labeled by the elements of S, 
and enter the element a « b in the cell in row a and column b (for all a, b € S’). This 
is called a multiplication table or a Cayley table. 


Let * be be a binary operation on a finite set S = {aj, do, ..., d,} having n 
elements. We construct a Cayley table for « as follows: 


* aj a2 oe an 

Q,| A, * a, A, * a2... Ai xan 
az} Ag * A, dy * a2... Ad* ay 
An|An * A, Ay, * a2... A, * Ay, 


Let « be a binary operation on a set S. There are a number of interesting properties 
that a binary operation may or may not have. Specifying a list of properties that a 
binary operation must satisfy will allow us to define deep mathematical objects such 
as groups. 


e «is commutative ifaxb=bxa,foralla,be S. 
e x is associative if a * (b*c) = (ax b)«c, for alla,b,cé S. 
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Fig. 3.2 Cayley table of a 
commutative binary 
operation 


Let S be finite. The property thata « b = b x a, foralla, b € S,means that the Cayley 
table must be symmetric across the main diagonal, see Fig. 3.2. 


Example 3.9 (1) Let S=R and axb =a, for all a,b € S. Then 3*5 =3 and 
5x3 = 5, and so x is not commutative. However, if a, b,c € S, then a * (b*c) = 
a=axb=(axb) xc, and hence x is associative. 

(2) Let S = Q \ {0} andaxb=a/b, for alla,bé S. Then l*3=1/3 43= 
3/1, and so * is not commutative. Also, we have 


1 & (2% 3) = 1% (2/3) = 3/2 £ 1/6 = (1/2) «3 = (1 *2) 3. 


Hence x is not associative. 


Exercises 


1. Is (Z, x) a groupoid if * is defined, for each x, y € Z, by 


(a) xxy = JE Fy: 
(b) xxy=O+y)5 
(Cc) xx y=x-y—xy; 
(d) xx y=0. 

2. Define « on Qasaxb=ab +1, foralla,b € Q. Is * associative (prove or find 
a counterexample)? 

3. Prove that if * is an associative and commutative binary operation on a set S, then 
(a * b) « (c xd) = ((d «c) xa) «b, for alla,b,c,d €S. 

4. Let S be a finite set containing n elements. Find the total number of binary 
operations on S. 
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5. Let S be a finite set containing n elements. Show that the total number of com- 
mutative binary operation on S$ is n”"—)/?, 


3.3. Semigroups and Monoids (Optional) 


A semigroup is an algebraic structure consisting of a non-empty set together with 
an associative binary operation. The formal study of semigroups began in the early 
twentieth century. Semigroups are important in many areas of mathematics. We give 
here some basic definitions and very basic results concerning semigroups. 


Definition 3.10 A semigroup is a pair (S, *) in which S is a non-empty set and « is 
a binary associative operation on S. 


Semigroups are therefore one of the most basic types of algebraic structure. 
For an element x € S we let x” be the product of x with itself n times. So, xl=x, 


x? = xxx, andx"t! = x"*%x forn > 1. 
Let x1, X2,..., X, be a sequence of elements of a semigroup (S, *). We define 
Pi=*1, 


Pk = Pr-1* Xx, fork > 1. 


Now an arbitrary product of n elements of S is determined by an expression involv- 
ing n elements of S together with equal numbers of left and right parentheses that 
determine the order in which the product is evaluated. The general associative law 
ensures that the value of such a product is determined only by the order in which 
the elements of the semigroup occur within that product. Thus a product of n ele- 
ments of S has the value x, * x. *...* X,, Where x;, X2,...,X, are the elements to 
be multiplied, listed in the order in which they occur in the expression defining the 
product. 


Example 3.11 Given four elements x,, x2, x3, and x4 of a semigroup (S, «), the 
products 


(C1 * X2) &X3) *X4, (XL K XD) m (XZ XG), (HL & (XQ * 13) * X4, 
X1 * ((X2 *X3) *X4), Xp w (X2 & (3 * X4)), 


all have the same value. Note that according to the above definition, py = ((x; * 
X2) *X3) * X4. 


Theorem 3.12 (General Associative Law) Let (S, *) be a semigroup and x, X2,..., 
Xn € S. Every way of inserting balanced pairs of brackets into the product x * x2 * 
...*Xn give the same result. 


Proof We prove that any insertion of brackets into the product gives the same result 
aS X1 * (Xz & (X3 *... * X,)...). We proceed by mathematical induction on n. For 
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n = 1, the result is trivially true, for there is only one way to insert balanced pairs of 
brackets into the product x;. This is the base case of the induction. 

So, assume that the result holds for all n < k; we aim to show it is true for k. Take 
some bracketing of the product x,, x2, ..., x, and let y be the result. This bracketing 
is a product of some bracketing of x;,..., x; and some bracketing of xj+1,..., Xx 
for some 1 < j < k. Now, we consider the following two cases: 

Case 1: Suppose that j = 1. By the assumption, the result of inserting brackets 
into Xp kK... K XE = XQ... * Xz 18 Equal to Xz * (3 * (...X,)...). Thus, y = x1 * 
(X2 * (x3 * (...X%)...)), Which is the result with n = k. 

Case 2: Suppose that j > 1. By the assumption, the result of the bracketing of 
Xp we... XK; ISX] & (X72 * (...x;)...) and the result of the bracketing of xj4) *...* 
Xz AS Xj41 * (Xj42* (...xX%)...). Consequently, we obtain 


y= (x1 (x2 *(...%j)-.)) (x41 & (a jg5e Cc Ke) =) 
Xi* (((x2 (Lj). )) me pur ® (jy * GC. XE) -. ))) (by associativity) 
= X1 & (X2 * (X3...X4)...) (by assumption n = k — 1), 


which is the result with n = k. 
Therefore, by induction, the result holds for all n. |_| 


A semigroup S is finite if it has only a finitely many elements. A semigroup S is 
commutative, if it satisfies x * y = y x x, forall x, y € S. If there exists e in S such 
that for all x € S, 

exxX=xXxee=xX 


we Say that S is a semigroup with identity or (more usual) a monoid. The element e 
of S is called identity. 


Proposition 3.13 Let e be a left identity of S and e' is a right identity of S, then 
e = e’. Consequently, a semigroup contains at most one identity. 


Proof Since e¢ is a left identity, it follows that e x e’ = e’. Since e’ is a right identity, 
it follows that e = e x e’. Hence, we get e = exe’ = e’, as desired. |_| 


By Proposition 3.13, the identity element is unique and we shall generally denote 
it by 1. 


Example 3.14 Let N° = N U {0} be the set of all non-negative integers. Then, (N°, -) 
is a semigroup for the usual multiplication of integers. Also, (N°, +) is a semigroup, 
when + is the ordinary addition of integers. Define (N°, «) by n xm = max{n, m}. 
Then, (N°, «) is a semigroup, since 


nx(mxk) = max{n, max{m, k}} = max{n, m, k} 
= max{max{n, m}, k} = (n*xm) «xk. 


Example 3.15 Let S be anon-empty set. There are two simple semigroup structures 
on S$: with the multiplication given by x * y = x, for all x, y € S, in this case, the 
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Fig. 3.3. The rectangular 
band 


semigroup (S, *) is called the left zero semigroup over S. Also, fixing an element 
a € § and putting x * y = a, for all x, y € S, gives a semigroup structure on S. 


Example 3.16 The opposite semigroup S°? of S is the semigroup with the same set 
as S but “reversed multiplication”. That is, for x, y € S, the product x xp, y in S°? 
is equal to the product y « x in S. It is easy to check that S°? is indeed a semigroup. 
Notice that if S is commutative, then S°? and S are the same semigroup. 


An element x in a semigroup S is said to be idempotent if x” = x. 


Example 3.17 Let J, J be two non-empty sets and set T = J x J with the binary 
operation 
Gpj)xrkD=GD. 


Then, we have 


(a, Dx*k, 1)) x (m,n) = (i, 1) * (m,n) = (i, n), 
(i, j) * ((k, 1) * (m, n)) = (i, 1) * (k,n) = (i, n), 


for all (i, 7), (k,/) (m,n) € T, and so the multiplication is associative. Hence, 
(T,*) is a semigroup called the rectangular band on I x J. Note that (i, 7)? = 
(i, j) * (i, 7) = Gi, J), ie., every element is an idempotent. The name derives from 
the observation that if the members of J x J are pictured in a rectangular grid in the 
obvious fashion, then the product of two elements lies at the intersection of the row 
of the first member and the column of the second, see Fig. 3.3. 


Example 3.18 The direct product S x T of two semigroups (S,-) and (T, 0) is 
defined by 


(81, 41) * (82, f2) = (81 + 52, Of) (81,52 € S, 4, € T). 
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Itis easy to show that the so defined product is associative and hence the direct product 
is, indeed, a semigroup. The direct product is a convenient way of combining two 
semigroup operations. The new semigroup S x T inherits properties of both S and 
T. 


Example 3.19 On the Cartesian product Z x Z we define a binary operation as 
follows: 
(a—b+c,d)ifb<c 
(a, b) x (c,d) = 4 (a, d) ifb=c 
(a,d+b-c ifb>c, 


for all a, b, c,d € Z. The set Z x Z with such defined operation is a semigroup. 


Example 3.20 Let (S, «) be a semigroup, which is not a monoid. Find a symbol | 
such that 1 ¢ S. Now, we extend the multiplication on S to S U {1} by 


axb=axb ifa,beS, 
axl=a=I1xaforallaec 5S, 
l*el=1. 


Then, * is associative. Thus, we have managed to extend multiplication in S to 
SU {1}. For an arbitrary semigroup S the monoid S! is defined by 


sia S if S is a monoid, 
~ ) SU {1} if Sis not a monoid. 


Therefore, S! is “S with a 1 adjoined” if necessary. 


In a semigroup (S, *) an element z; such that z; * x = z, for every x € S is called 
a left zero element of S, and an element z, such that x * z, = z, for every x € S is 
called a right zero element of S. If z is both a left and a right zero element of S, then 
z 1s called a two-sided zero element, or simply a zero element, of S. A semigroup 
S may have any number of left (or of right) zero elements, but if it has a left zero 
element z; and a right zero element z,, then z;) = z; * Zz; = z,, whence S has a unique 
two-sided zero element and no other left or right zero element. Usually, we denote 
zero element by 0. 


Example 3.21 Let (S,*) be a semigroup. Similar to Example 3.20, let 0 be an 
element not in S and extend the multiplication on S to S U {0} by 


Oxx =xx0=0%0=0, 


for all x € S. Again, this extended multiplication is associative. Thus, S U {0} is a 
semigroup with zero element. We define 


99 = S if S has a zero, 
~ | §U {0} otherwise. 
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Then, S° is called the semigroup obtained by adjoining a zero to S if necessary. 


Theorem 3.22 [f (S, x) is a finite semigroup, then there is an idempotent element in 
S. 


Proof Since S is finite, it follows that a, a’, a°*,... cannot be distinct elements. So, 

there exist integers i and j withi < j such that a’ = a/. This implies that a‘t* = a’, 

where k = j — i. Now, we have 
quitk 


=qi x qitk = a. 


Next, by induction, we observe that 


qritk = qr 


’ 


for all positive integer n. Also, we have 


qrit2k = qritk x ak = qt x ak = qrtk a at, 


qrit3k ee qrit2k x ak a qrit2k = a”, 


and so on. Therefore, we deduce that 


ni 


qritrk =a", 


for all positive integer r. In particular, we have at" = a", or a**' = a". Now, if 
we take m = ki, then a” is an idempotent element. a 


Let (S,*) be a semigroup. An element b of the semigroup S is called a right 
divisor of the element a of the same semigroup if there exists an element x € S such 
that x * b = a. An element b is called a left divisor of a if there exists an element 
y € S such that b* y = a. If b is a right divisor of a, we say that a is divisible on 
the right by b. If b is a left divisor of a, we say that a is divisible on the left by b. If 
the element b of S is a right divisor of the element a of the same semigroup, then the 
element x satisfying the equation x * b = a is called a left inverse of b with respect 
to a. The notion of right inverse is defined analogously. An element which is both a 
right inverse and a left inverse of b with respect to a is called a two-sided inverse, or 
shortly an inverse, of b with respect to a. 


Definition 3.23 Let M be a monoid and x € M. If there exists an element x’ such 
that x * x’ = 1, then x’ is a right inverse for x, and x is right invertible. Similarly, 
suppose that there exists an element x” such that x” * x = 1, then x” is a left inverse 
for x, and x is left invertible. If x is both left and right invertible, then x is invertible. 


Proposition 3.24 Let M be a monoid, and let x € M. Suppose that x is invertible 
and let x’ be a right inverse of x and x” be a left inverse of x. Then, x' = x". 
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Proof Since x’ and x” are right and left inverses of x, respectively, it follows that 
x *«x’ = 1andx” «x = 1. Thus, we have 


x= Lax = (KX aX) KX HK we (XX) HX" kK LHX", 


as desired. |_| 


Proposition 3.24 says that right and left inverses coincide when they both exist. 
The existence of one does not imply the existence of the other. Thus, if x is an 
invertible element of a monoid M, denote the unique right and left inverse of x by 

-1 
es 


Definition 3.25 A subsemigroup A of a semigroup S is a non-empty subset of S 
such that A* A C A,ie.,x, y € A implies x * y € A. A submonoid A of a monoid 
M is asubsemigroup A which contains the identity | of M. 


Theorem 3.26 The set of invertible elements of a monoid forms a submonoid. 


Proof Let I be the set of invertible elements of a monoid M. Since | € J, it follows 
that J is non-empty. Suppose that x and y are two arbitrary elements of J. Since x 
and y are invertible, it follows that 


1 1 1 


x(xlaex)ey=yley=l, 
Lae ed 


(yl exo!) & (xe y) = 7 
(x ey) «(yl eT!) = xk (ye YT!) aT 


Hence, x * y is invertible and so x x y € J. Consequently, J is a subsemigroup of M. 
In addition, 1 € J is an identity for J. Therefore, J is a submonoid of M. | 


Definition 3.27 A semigroup (S, «) is left cancellative, if 
cra=crba>a=b, 

and (S, *) is right cancellative, if 
axc=bxc>a=b. 


If (S, «) is both left and right cancellative, then it is cancellative. 


Exercises 


1. Let (Q, *) be a groupoid, where a*b =a+b-—ab, foralla,beQ. 


(a) Is the groupoid (Q, *) a semigroup? 
(b) Is there an identity element in (Q, x)? 
(c) Which elements of the groupoid have inverses? 


3.3 
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If Q is replaced by Z in Exercise 1, are the solutions the same? 

A set S has n elements, what is the number of commutative binary operation of 
S? 

Prove that if S is a semigroup and e € S is both a right zero and a right identity, 
then S is trivial. 

In the set of all continuous functions of two variables x and y, we define in the 
square 0 < x < a,0 < y <a, the following operation, which plays an important 
role in the theory of integrals. The result of this operation carried out for the 
functions K(x, y) and K2(x, y) is the function 


i: K(x, t)Ka(t, y)dt. 
0 


By using the simplest properties of integrals, show that we obtain a semigroup. 
We consider the set of functions of one variable which are absolutely integrable 
for 0 < x < oo. In many branches of mathematics, one considers the operation 
in this set, the result of which for f,(x) and fo(x) is the function 


Ai) fox — t)dt. 
) 


Show that this operation is associative and commutative. 

Prove that for any commutative monoid M, the set of idempotent elements of M 
forms a submonoid. 

Let S be a left cancellative semigroup. Suppose that e € S is an idempotent. Prove 
that e is a left identity. Deduce that a cancellative semigroup can contain at most 
one idempotent, which must be an identity. 

Give an example of a semigroup containing more than one idempotent but con- 
taining no identity. 


3.4 Groups and Examples 


In this section, we embark on the study of the algebraic object known as a group. We 
give a few basic definitions and then concentrate on examples of groups. 


Definition 3.28 Let G be a non-empty set and « be a binary operation on G. We say 
that (G, «) is a group if it satisfies the following conditions: 


(1) 
(2) 


(3) 


Associativity Law: a x (b xc) = (a * b) xc, for alla, b,c € G; 

Existence of Identity: There exists an element e € G called an identity such that 
axe=exa=a,forallaecG; 

Existence of Inverse: For each a € G, there exists an element b € G such that 
axb=bxa =e (we write this element b as a~! and call it the inverse of a in 
G). 
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Hence, a group G is a monoid G that satisfies the condition (3). In other words, 
a group G is a semigroup with an identity in which every element has an inverse. 


Remark 3.29 In many books, there is an extra condition in the definition of group, 
that is, “G is closed under x’. In other words, x * y € G, for all a,b € G. But 
according to our definition, this condition is a result of the definition of binary 
operation. 


Corollary 3.30 If G is a group, then 


(1) The identity element of G is unique; 
(2) Everya € G has a unique inverse in G. 


Proof Since any group is a semigroup and a monoid, the result follows from Propo- 
sitions 3.13 and 3.24. a 


Definition 3.31 The number of elements in a group G is called the order of G and 
is denoted by |G]. If |G| is finite, then G is said to be a finite group; otherwise G is 
an infinite group. 


Before starting to look into the nature of groups, we look at some examples. 


Example 3.32 (Additive Groups) The set of integers Z, the set of rational numbers 
Q, the set of real numbers R, and the set of complex numbers C are all groups under 
ordinary addition. 


Example 3.33 If E is the set of even integers, then E is a group under addition of 
integers. 


Example 3.34 The sets Q* = Q \ {0} and R* = R \ {0} are groups under ordinary 
multiplication. 


Example 3.35 Let G be the set of real numbers of the forma + b./2, wherea, b € Q 
and are not simultaneously zero. Then, G is a group under the usual multiplication 
of real numbers. 


Example 3.36 Let C* be the set of all non-zero complex numbers, i.e., 
C* = {a+ bi| a,b € Rand (a 40 orb £0)}. 

Recall that i? = —1. We define multiplication of complex numbers as follows: 
(a+ bi) - (c+ di) = (ac — bd) + (ad + be)i. 

This is a binary operation on C*. Note that both ac — bd and ad + bc cannot be 


zero. Suppose that a + bi,c + di ande + fi are arbitrary elements of C*. Then, we 
have 
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((a + bi) - (c +di)) - (e+ fi) 
((ac — bd) + (ad + bc)i) - (e + fi) 
= ie — bd)e — (be + ad) f) + ((bc +ad)e+ (ac — bd) f )i 


a(ce — df) — b(de +cf)) + (b(ce — df) + a(de+cf))i 
=(a+bi)- (eae (de +cf)) 
= (a + bi)- ((c+di)- (e+ fi)). 


Hence, the multiplication - is associative on C*. Clearly, 1 = 1+ 0i € C* is the 
identity element in C*. Now, we check the existence of inverse. Assume that a + bi € 
C*. Then not both a and b are zero, and so a? + b? 4 0. Consequently, we have 


a b 


. * 
a2 +b? a+b lt 


Moreover, we see that 


a b, : ; a b, 
(aap wp pi)Ot b= 040 Boe appl) =) 


Therefore, we proved that (C*, -) is a group and we call this group the multiplicative 
group of non-zero complex numbers. 


Example 3.37 The set K4 = {e, a, b, c} under the binary operation defined by the 
following Cayley table is a group. 


gg 829 OFS 
en Qarajyoa 


SFa nn Bfag 


Qa Fa a 
a Fa ala 


This group is known as Klein’s four group. 


Definition 3.38 A group G is said to be abelian (or commutative) ifaxb=bx«a, 
for alla,b eG. 


All the previous examples of groups are abelian. Now, we give some examples of 
non-abelian groups. 


Example 3.39 The group D3 is the symmetry group of an equilateral triangle with 
vertices on the unit circle, at angles 0, 27/3, and 47/3, that is, it is the set of all 
reflection, rotation, and combinations of these, that leave the shape and position of 
this triangle fixed. Figure 3.4 shows the effect of the sixth element of D3. Note that 
r,s and t can be equally well considered as a rotation of 9 = in R? about the axes 
r,s and t. Then, these six operations form a group. The Cayley table for this group 
is: 
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Identity 


— 


2 
Rotation of > counterclockwise about center 


Reflection across the line r 


Rotation of = clockwise about center 
| 
i 


Reflection across the line s 


Reflection across the line t 


>> > > > D> 


Fig. 3.4. The symmetry group of an equilateral triangle 


- |Ro Ry Ro So Sy So 
Ro|Ro Ry Ro So Sy $2 
R,|R, Ro Ro S; So So 
R2|R2 Ro Ry S2 So Si 
So|So Sz $1 Ro Ro Ry 
S| S1 So Sz Ry Ro Ro 
So So Sy So Ro Ri Ro 


Example 3.40 The group Dg is the symmetry group of a square with vertices on the 
unit circle, at angles 0, 2/2, 2, and 37/2. Figure 3.5 shows the eight symmetries of 
square. Let 

Dg = {Ro, Ri, Ro, R3, So, Si, S2, S3} 
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Identity 
—> 


Rotation of 5 counterclockwise about center 
> 


Rotation of 7 counterclockwise about center 


: 3 : 
Rotation of > counterclockwise about center 


Reflection acros a horizontal axis 


Reflection across a vertical axis 


27 8 8 8 8 


Reflection across the main diagonal 


Reflection across the other diagonal 
> 


ee 8 88 8 8 


Fig. 3.5 The symmetry group of a square 
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be the symmetry group of a square. The Cayley table for this group is: 


Ro Ry Ro R3 So Si S2 83 
Ro} Ro Ri Ro R3 So Sy Sy S3 
R1| Ry Rz R3 Ro S; Sz $3 So 
R2| Ro R3 Ro Ry Sz S3 So Si 
R3| R3 Ro Ry Ro S3 So S, Sz 
So} So S3 S2 S; Ro R3 Ro Ry 
S| Sy So S3 Sy Ry Ro R3 Ro 
Sz | Sp S, So S3 Ry Ry Ro R3 
S3| $3 So S; So R3 Ro Ri Ro 


Example 3.41 Let G be the set of all 2 x 2 matrices i where a,b,c, and d 


a 
cd 
are real numbers such that ad — bc # 0. For the binary operation in G we use the 
following multiplication: 


ab a b'| _ | aa'+be' ab’ +bd' 
cd cd’\~ | ca'+dc' cb'+dd' |" 
Clearly, the entries of this 2 x 2 matrix are real. In order to see that this matrix 
belongs to G we must show that 
(aa' + bc’)(cb' + dd’) — (ab' + bd')(ca' + dc’) £0. 
A short computation gives that 


(aa’ + bc')(cb! + da’) — (ab + bd')(ca' + dc’) = (ad — bc)(a'd' — b'c') £0, 


ab Led a’ b’ 

cd c’ d' 
are in G. It is not difficult to check that the associative law holds in G. The element 
1D) 


because both 


= ; ; belongs to G, and it acts as an identity element relative to the binary 
operation of G. 
Now, suppose that ‘i if be an arbitrary element of G. Then, we have ad — bc # 


0, and so the matrix 
d —b 


ad—bc ad—bc 
—c a 


ad—bc ad—bc 
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makes sense. Moreover, we obtain 


d —b = d—b 1 
jae lee (ee) a 7 > ee = ee ee 


Thus, the matrix 


d —b 
ad—bc ad—bc 
—c a 


ad—bc ad—bc 
belongs to G. An easy computation shows that 


d —b d —b 
ab| | ad—be ad—be | ~|19|_ | ad—be ad—be | .| 4? 
cd —e a 01 —e a cd|- 


ad—bc ad—bc ad—bc ad—bc 


This means that this element of G is the inverse of E Al Therefore, we conclude 
that G is a group. 

Example 3.42 Let G = {(a,b) | a,b € R and a > 0}. We define the following 
operation on G as follows: 


(a, b) x (c,d) = (ac, bc + d), 


for all (a, b), (c,d) € G. Obviously, if b 4 0 and c 4 0, then bc +d 4 0. So, x is 
a binary operation on G. Now, we verify associativity. Suppose that (a, b), (c, d), 
and (e, f) are arbitrary elements of G. Then, we have 


((a, b) * (c,d)) x (e, f) = (ac, bc + d) x (e, f) 
= (ace, bce + de+ f) 
= (a, b) x (ce,de+ f) 
= (a, b) x ((c, d) x (e, J): 


Moreover, since (a, b) x (1, 0) = (1, 0) « (a, b) = (a, b), it follows that (1, 0) is the 
identity element. Finally, since 


(a.b)*(2 7)= (5. 2) « (a,b) = (1,0), 
a a a a 


it follows that (1/a, —b/a) is the inverse of (a, b). Therefore, (G, «) is a group. But 


(2,3) * (1, 4) = (2, 7) and (1, 4) « (2, 3) = (2, 11). 


This shows that G is not abelian. 
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Example 3.43 Let G be the set of all functions T,, : R — R defined by 7,,,(x) = 
ax + b, for each x € R, where a,b € Randa £ 0, i.e., 


G ={T,,:R—> R|a,be€ Randa ¥ 0}. 


Consider the product of elements of G as the compositions of functions. Hence, if 
Ta,» and T,,¢ are elements in G, then 


(Ta,b ° Te) (x) = Ta,b(Te,a(x)) = aT,.a(x)b 
= a(cx + d)+b= (ac)x + (ad + b) 
= ac,ad+b(X), 


for all x € R. Therefore, we conclude that 
Tab ° Tod _ Tac,ad-+b- 


This yields that Tyc.aa+b € G, 1.e., G is closed under the composition of functions. 
For all elements 7, ,, T,,q and T., ¢ in G, we see that 


(Tun ° Te) ° Te, ¢ on Tac,ad+b ° Te, f 
= Lace,acf+ad+b 
= tab? Tee,cf-+d 
= 1g) 0 (fa fe) Te5) 


So, the associativity law holds. The element 7} 9 is the identity function and 7; 9 o 
Tab = Ta,» © T1,0 = T1,0. Finally, what is eee! We must find real numbers x 4 0 
and y such that 7,4 0 Ty,y = Ty,y 0° Ta,p = T1,0, or equivalently 


Tax,ay+b = Tax,bx-+y — T\,0- 


This implies that ax = 1 anday + b= bx + y = 0. Remember a 4 0, soa = 1/a 
and y = —b/a. Thus, T;,-1,-q-1p is the inverse of T,,,,. Consequently, (G, 0) is a 
group. Since T,,7 o 73,4 # T3,4 0 T\,2, it follows that G is not abelian. 


It is a little clumsy to keep writing the « for the product in G, and from now on 
we shall write the product a * b simply as ab, for alla, b € G. 


Theorem 3.44 /f G is a group, then 


(1) For everyae G, (a“!)"! =a; 
(2) Foralla,b € G, (ab)~! = ba. 


Proof (1) For any a € G, we have 


aa“! =e=a'a and (a!)"!a~'! =e =a'(a"!)"!. 
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Since the inverse of a~! is unique, it follows that (a~!)~! = a. 
(2) This item is proved by the equalities 


(ab)(b“!a~!) = a(bb™)a~! = aea~! = aa"! =e, 
(b-!a7!)(ab) = b“'(a~'a)b = b“'eb = b“'b =e, 


and the uniqueness of the inverse. | 


In part (2), notice the change in the order of the factors from ab to b~'a7!. If ina 
group the binary operation is written additively, then a + b (for a, b € G) is called 
the sum of a and b, and the identity element is denoted by 0. Also, the inverse of 
a € Gis denoted by —a. We write a — b fora + (—b). Abelian groups are frequently 
written additively. 

Note that a” = aa...a. If n = —m is a negative integer, then we define a” = 

—— 
n times 
(a~')”; also, we define a® = e. The formulas a”a" = a+" and (a”)" = a"" hold 
for any element a of G and any pair of integers m and n. One must be careful 
with this notation when dealing with a specific group whose binary operation is 
addition. In this case, the definitions and group properties expressed in multiplicative 
notation must be translated to additive notation. Table 3.1 shows the common notation 
and corresponding terminology for groups under multiplication and groups under 
addition. 


Corollary 3.45 Jn any group G both the cancellation laws hold, i.e., 


(1) ab = ac implies b = c; 
(2) ba =ca implies b= c. 


Proof It is enough to multiply a! on the left or right side. | 
Theorem 3.46 Any finite semigroup in which both cancellation laws hold is a group. 


Proof Suppose that G is a finite semigroup with n elements in which both cancel- 
lation laws hold and let 
G = (x1, x2, i svg An fe 


Suppose that a € G is arbitrary and fixed. Then, the elements ax), ax2, ..., Xp 


are distinct, because ax; = ax; (for some i and j) implies that x; = x; (the left 
cancellation law). Consequently, we have 


Table 3.1 Notation and corresponding terminology for groups under multiplication and addition 


Multiplicative group Additive group 
a-borab Multiplicative a+b Addition 
eorl Identity 0 Zero 

a! Multiplicative inverse of a na Multiple of a 


Power of a a—b _ Difference 
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G = {ax, aXx2, ..., Xp}. 
Hence, for each x; € G, there exists x; € G such that 
Xj = ax;. (3.1) 


In particular, there exists x, € G such that a = ax,. Then, ax; = (ax,)xj = a(XEXj). 
Now, by the left cancellation law, we conclude that x; = x,x;, foreach 1 <i <n. 
Similarly, by considering the elements x;a, x2a, ..., X,a and using the right 
cancellation law, we can find an element x; € G such that x; = x;x;, for each 1 < 
i <n. So, we get x; = x¢x; = xz. Therefore, x; is the identity element in G. As 
usual, we set x, = e. 
Taking x; = e in (3.1), then we can say that there exists x,, € G such that 


€ = AX. 
In a similar way, by considering the elements xa, x2a, ..., X,a, we can find an 
element a, € G such that 

e=Xx,a. 


Thus, we can write 
Xp = Xe = X;(aXm) = (X;@)Xm = Xm = Xm- 


Hence, x,,a@ = aXm = e, whichimplies that x7! = x,,. This shows that every element 
in G has its inverse in G. Therefore, G is a group. a 


Corollary 3.47 A finite semigroup G is a group if and only if G satisfies both can- 
cellation laws. 


Proof The proof follows immediately from Corollary 3.45 and Theorem 3.46. MH 


Theorem 3.48 Let G be a non-empty set together with a binary operation. Then, G 
is a group if and only if 


(1) a(bc) = (ab)c, for all a,b € G; 
(2) Given any a,b € G, the equations ax = b and ya = b have unique solutions in 
G. 


Proof Let G be a group. Since a(a~'b) = (aa~')b = eb = b, it follows that x = 
a~'b is a solution of the equation ax = b in G. Similarly, y = ba! is a solution of 
the equation ya = b in G. The uniqueness of the solution follows from the cancel- 
lation law. 

Suppose that G is a non-empty set together with a binary operation satisfying 
(1) and (2). In order to show that G is a group we need to show that G contains an 
identity and each element of G has an inverse. 

Let a € G be an arbitrary element. By (2), since ax = a has solution in G, it 
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follows that there is an element e € G such that ae = a. 

Now, given b € G, then by (2) there exists y € G such that b = ya. Then, 
be = (ya)e = y(ae) = ya = b, and so be = b, for each b € G. Similarly, since the 
equation ya = a has solution in G, it follows that there exists e’ € G such that 
e’a =a. Again, given b € G, by (2) there exists z € G such that b = az. Then, 
eb = e'(az) = (e'a)z = az = b, and so e’b = b, for each b € G. In particular, since 
be = b, foreach b € G, it follows that e’e = e’. Also, since e’b = b, for eachb € G, 
it follows that e’e = e. Consequently, e = e’ and we get be = b = eb, foreachb € G. 
This shows that e is the identity element of G. 

Next, we prove the existence of inverse. Let a € G. Then, by (2), there exist 
a’,a" € G such that aa’ = e and a”a = e. Hence, we obtain 


a’ =ed =(a"a)a' =a" (aad') =a"e=a", 


and we conclude that a’a = e = aa’. This yields that a~! = a. a 


Theorem 3.49 Let G be a non-empty set together with a binary operation. Then, G 
is a group if and only if 

(1) a(bc) = (ab)c, for alla, b € G; 

(2) There exists an element e € G such that ae = a, for alla € G; 

(3) For every a € G, there exists a’ € G such that aa' = e. 


Proof If G is a group, then by Definition 3.28, the statements (1), (2), and (3) hold. 

Now, assume that G is anon-empty set together with a binary operation satisfying 
(1), (2), and (3). Let a € G. Then, by (3), there is a’ € G such that aa’ = e. Again, 
by (3), there exists a” € G such that a’a” = e. Hence, we get 


a'a = (a'a)e = (a'a)(a'a") = a'(a(a'a")) = a'((aa')a") 
— a'(ea") _ (a’e)a" = aa!’ =e. 


Consequently, we have aa’ = e = a’a. In addition, we find that 
ea = (aa’')a = a(a')a = a(a’a) =ae=a, 


and so ea = ae = a. Therefore, we deduce that G is a group. a 
Theorem 3.50 Let G be a non-empty set together with a binary operation. Then, G 
is a group if and only if 

(1) a(bc) = (ab)c, for alla, b € G; 

(2) There exists an element e € G such that ea = a, for alla € G; 

(3) For every a € G, there exists a’ € G such that a’a = e. 

Proof The proof is similar to the proof of Theorem 3.49. a 


The following example shows that the conclusion of Theorems 3.49 and 3.50 fails, 
if a semigroup S contains a right identity e and each element a € G may possess a 
left inverse. 
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Example 3.51 Let S be a set having at least two elements. For any x, y € G, we 
define xy = x. It is easy to check that S together with this binary operation is a 
semigroup. Let b and c be two distinct elements of S. Then, foreach a € S, we have 
ab = ac, while b £ c. Hence, S does not satisfy cancellation law, and so S is not a 
group. Now, let e be any fixed element of S. We have 


(1) For each a € S, ae =a. This means that e is a right identity element; 
(2) For eacha € G, since ea = e, it follows that e is a left inverse of a with respect 
toe. 


Consequently, S has the right identity e and each element of S has a left inverse in S 
but we know that G is not a group. 


We leave it to readers to construct a similar example of a semigroup S which has 
a left identity and right inverse for each of its elements but it fails to be a group. 


Definition 3.52 Let a and b be elements of a group G. We say that a and b are 


conjugate in G (and call b aconjugate of a) if there exists x € G such thatb = x~!ax. 


We write, for this, a ~con; b and refer to this relation as conjugacy. 
Theorem 3.53 Conjugacy is an equivalence relation on G. 


Proof Since a = e—lae, for each a € G, it follows that a ~Conj A; 1.€., ~Conj 18 
reflexive. 

If a ~conj b, then there exists x € G such that b = x~!ax. Hence, a = (x~!)7! 
bx~!. Since x~! € G, it follows that b ~¢onj 4, 1-€., ~Conj iS symmetric. 

Suppose that a ~¢onj b and b ~conj c, where a, b,c € G. Then, b = x~ ‘ax and 
c = y 'by, for some x, y € G. Substituting for b in the expression for c we obtain 
c= y !x~laxy. This shows that c = (xy)~!a(xy). Since xy € G, it follows that 
4 ~Conj Cs 1-€., ~Conj 18 transitive. |_| 


1 


Theorem 3.53 shows that we can divide a group G into equivalence classes under 
the relation of conjugacy. Each such equivalence class is called a conjugate class. 


Exercises 


1. Prove that the set of all rational numbers of the form 36”, where m and n are 
integers, is a group under multiplication. 
2. LetG = {a € R| —1 <a < ]}. Define a binary operation * on G by 


for all a, b € G. Show that (G, x) is a group. 


3.4 


1S) 


IAN a 


(oe) 


\o 


10. 
11. 


12. 
13. 


. Show that the equation x 
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. Let G be the set of all rational numbers except —1. Show that (G, «) is a group 


where 
axb=a+b+ab, 


foralla,beG. 


. If a and b are elements of a group G, prove that ab"a~! = (aba~')", for each 


integer n. 


. Show that if every element of the group G is its own inverse, then G is abelian. 
. If G is a group of even order, prove it has an element a  e satisfying a” = e. 
. Prove that a finite semigroup G with identity is a group if and only if G contains 


only one idempotent. 


. Let G be a group. 


(a) If G has three elements, then show that it must be abelian; 
(b) Do part (a) if G has four elements; 
(c) Do part (a) if G has five elements. 


. Consider the set {0, 1, 2, 3, 4, 5, 6, 7}. Suppose that there is a binary operation 


* on G that satisfies the following two conditions: 


(a) axb<a+t+b,foralla,beG; 
(b) axa = 0, for alla € G. 


Construct Cayley table for G. (This group sometimes called Nim group). 

Find an example which shows it is impossible to have (ab)~* 4 b~7a~?. 

In a finite group, show that the number of non-identity elements that satisfy the 
equation x° = e is a multiple of 5. If the stipulation that the group be finite is 
omitted, what can you say about the number of non-identity elements that satisfy 
the equation x° = e? 

For each positive integer 1, prove that the number of groups of order 7 is finite. 
Let G be a group in which (ab)" = a"b” for some fixed integer n > 1 and for 
alla, b € G. For all a, b € G, prove that 


(a) (ab)""! = b'lgr!, 
(b) ab"! — b"-!q"; 
(c) (abag!b7!)""-) = e, 


. In Theorem 3.46 show by an example that if one just assumed one of the can- 


cellation laws, then the conclusion need not follow. 


. Show that in Theorem 3.46 infinite examples exist, satisfying the conditions 


which are not groups. 


2ax = a~' has a solution for x ina group G if and only 


if a is the cube of some element in G. 
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3.5 Turning Groups into Latin Squares (Optional) 


The name “Latin square” was inspired by mathematical papers by Leonhard Euler 
(1707-1783), who used Latin characters as symbols, but any set of symbols can be 
used. 


Definition 3.54 A Latin square is ann x n table filled with n different symbols in 
such a way that each symbol occurs exactly once in each row and exactly once in 
each column. 


Example 3.55 The following are two examples of Latin squares. 


a|b|c|dle 
1} 2] 3 bl a|e|c|d 
3} 1 c|d|b|ela 
3;} 1] 2 d\|e|\a|bie 
e|c|djal|b 


Theorem 3.56 Cayley table of any finite group is a Latin square. 


Proof Take a row indexed by the group element a. Suppose that two elements in this 
row are equal. In other words, there are two columns c and d such that ac = ad. If 
we multiply by a~! on the left, this gives us c = d, i.e., these were in fact the same 
columns, and therefore, there are no repetitions in this row. The same logic tells us 
that there are also no repetitions in any column; therefore, this is a Latin square. Mi 


A natural question is: which Latin squares are Cayley tables of groups? 

Many of the group axioms for the operation - of a finite group can be checked by 
referring to the Cayley table for the group. From the table, one can quite easily check 
the existence of an identity element and observe that each group element occurs 
exactly once in each row and each column. The only group property that is difficult 
to check directly from the properties of the Cayley table is associativity. It must be 
shown that (ab)c = a(bc) for all elements a, b, and c of the group, and to check this 
property case by case is quite tedious. If the group contains n elements, there are 
n> ordered triple (a, b, c) that must be considered with (ab)c and a(bc) checked for 
equality in each case. 

The following theorem may simplify the verification of the associative law for 
multiplication. 


Theorem 3.57 Let G be a finite group. Let r and s be in the array representing 
the Cayley table for G such that the column containing r and the row containing s 
intersect at the identity element e. Then, rs is the element at the fourth corner of the 
rectangle formed by r,s, and e. 


Proof The identity element e occurs exactly once in each row and in each column, 
so suppose that e of the theorem is in the row headed by q and in the column headed 
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by b. Then, r occurs exactly once in the column headed by J, say in the row headed 
by p; sos occurs exactly once in the row headed by q, say in the column headed by 
a. 

-|a 


p 
q 


axl o 


5 


Thus, pb =r, qa =s, and gb = e. We wish to show that the fourth corner pa of 
the rectangle formed by r, s, and e is rs. We have 


rs = (pb)(qa) = p(bq)a. 


The last equality is a result of the associativity law present in the group G. However, 
qb is the identity; thus bq is the identity element. 
It follows that rs = pea = pa. a 


The rule stated in Theorem 3.57 is called the rectangle rule. 

Since we wish to emphasize that the rectangle rule follows from the associative 
law, Theorem 3.57 could be stated as follows: 

Suppose that G is a system with identity element e such that each group element 
occurs exactly once in each row and in each column of the Cayley table; then the 
rectangle rule is a consequence of Gs having the associative property. 

Suppose that we number the rows and columns from | to n and let a;; be the 
element in the ith row and jth column. Suppose that r = aj, and s = aj;;. Then, 
pa = Gj; Since pa is the same row as r in the same column as s. Also, e = a jx since 
e is in the same column as r and in the same row as s. The rectangle rule pa = rs 
now becomes the following: 

If e = ajx, then aj; = aj;xaj;, for alli and t. The table 


ab 
p\rs r 
q\s e 


now takes the form 


P| Git Gik 
4 |4jt Ajk 


Theorem 3.58 Suppose that G is a system with identity element e and with the 
following properties: 


(1) The Cayley table is a Latin square. In other words, each element of G occurs 
exactly once in each row and in each column; 
(2) The rectangle rule holds. 


Then, the operation obeys the associative law. 
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Fig. 3.6 Rectangle A rs ————___ r 
A 
8 e 
Fig. 3.7 Rectangle B sq ————_____-. 5 
B 
a e 
Fig. 3.8 Rectangles A and rs ———_____ r 
B together 
A 
sa CS CO e 
B 
a e 
Fig. 3.9 Rectangles A B Oe _ppg Sse 
and C together 
C A 
sa —————_———_- 5 ———_ 
B 
a e 


Proof Let r,s, anda be elements of G. By (1), e occurs in each column, so pick an 
e in the table. By (1), we can locate the r in the column containing e and the s in 
the row containing e. By (2), we know that the element in the fourth corner of the 
rectangle A is rs, see Fig. 3.6. 

By (1), we can find the e in the column containing the above mentioned s. By (1), 
a can be located in the row containing this e. By (2), the element in the fourth corner 
of rectangle B is sa, see Fig.3.7. Rectangles A and B fit together to form Fig. 3.8. 
We see that a new rectangle C is formed with three of the corners labeled sa, s, and 
rs, see Fig.3.9. By (2), the element in the fourth corner of rectangle C union A is 
r(sa). By (2), the element in the fourth corner of rectangle C union B is (rs)a. But 
these corners are the same. Consequently, we obtain r(sa) = (rs)a. a 


Example 3.59 Consider the following table: 
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Fig. 3.10 Rectangle related d 
to Example 3.59 


a 


e 


b 


e 


a 


No FAQ alo 
SFAaa nr SI 
nk an nanwe 
aa aeatia 
area a gala 


e 
a 
b 
c 
d 


This table is a Latin square, but rectangle rule does not hold as bb ¥ d in the rectangle 
in Fig. 3.10. 

The operation does not have the associative property, because (ab)c = a and 
a(bc) =c. 


Another method of verifying the associative law is to use permutations. We shall 
study this method in the future. 


Exercises 


1. Show that the number of n x n Latin squares is 1, 2, 12,576 forn = 1, 2, 3, 4, 
respectively. 
2. Which of the following Latin squares are Cayley tables of groups? 


MO) BG] RO] Re 
BY] WwW) rR] MN] be 
mm! Un] bdo} BY] 
we] RO] Mf Re} 
No] Re] BY] Wo] nm 
ny) BW) RD] Re 
WW} te} BY] bo 
BY] DR] HR] 
Re |W] do] ny & 
NO] Re] BR] Go] Nn 


3. Suppose the following is the Cayley table of a group G. Fill in the blank entries. 


abcd 


a Fa a 
| 
Qao0 & | 
QX 
o 
| 


XQ 
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3.6 Subgroups 


In general, we shall not be interested in arbitrary subsets of a group G because they 
do not reflect that G has an algebraic structure imposed on it. So, we are interested 
in special subsets of a group, called “subgroups”. The concept of subgroup is one of 
the most basic ideas in group theory. 


Definition 3.60 A non-empty subset H of a group G is called a subgroup of G if 
relative to the operation in G, H itself forms a group. 


Let e be the identity element of a group G. Then, trivially G and {e} are subgroups 
of G. These subgroups are called trivial subgroups of G. A subgroup H of G is called 
a proper subgroup if it is different from G as well as from {e}. We denote the relation 
“H is a subgroup of G” by H < G. 

It would be useful to have some criterion for deciding whether a given subset of 
a group is a subgroup. 


Theorem 3.61 (Two-Step Subgroup Test) A non-empty subset H of the group G is 
a subgroup if and only if 


(a) a,b € H implies ab € H; 
(b) a € H implies am! € H. 


Proof If H is a subgroup of G, then it is obvious that (1) and (2) must hold. 
Conversely, suppose that H is a subset of G for which (1) and (2) hold. In order 
to establish that H is a subgroup, all that is needed is to verify that e € H and that 
the associative law holds for elements of H. Since the associative law does hold for 
G, it holds all the more so for H, which is a subset of G. Now, if a € H, then by (2), 
a~' € H. Next, since a € H anda™! € H, by (1), it follows that e = aa~! € H. 
This yields that H is a subgroup of G.. a 


Theorem 3.62 (One-Step Subgroup Test) A non-empty subset H of the group G is 
a subgroup if and only if 


a,b € Himplies ab! eH. 


Proof Let H beasubgroup of G and leta, b € H. By Theorem 3.61 (2), we conclude 
that b—' € H.Now, sincea, b~'! € H, by Theorem 3.61 (1), it follows thatab~! € H. 

Conversely, let H be a non-empty subset of G such that ab! € H, foralla,b € 
H. Since H is non-empty, it follows that there exists x € H. So,e =xx! eH. 
Now, suppose that a and b are two arbitrary elements of H. Asa,e €¢ H,ea"'! € H, 
i.e.,a~! € H.Next, since a, b € H, it follows that b~! € H anda(b7!)~! € H. This 
implies that ab € H. Therefore, by Theorem 3.61, we conclude that H is a subgroup 
of G. a 


Theorem 3.63 (Finite Subgroup Test) Jf H is a non-empty finite subset of a group 
G, then H is a subgroup of G if 
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Proof In light of Theorem 3.61, we need to prove that a~! € H whenever a € H. 
Assume that a is an arbitrary element of H. If a =e, then a~'! =a and we are 
done. If a # e, then the set {a, a*, a>,...} is a subset of H. Since H is finite, 
there must be repetition in this set of elements, i.e., for some positive integers i 
and j, a =a/ withi > j > 0. Then, by the cancellation law in G, a'-i =e, and 
sincea # e,i — j > 1. Hence, we can write a'~/ = aa'~/—! = e, and consequently 
a'-J-! = a“! Buti — j — 1 => 1 implies a'~/~! € A and we are done. | 


Example 3.64 Let R be the group of all real numbers under addition, and let Z be 
the set of all integers. Then, Z is a subgroup of R. 


Example 3.65 Let R* be the group of all non-zero real numbers under ordinary 
multiplication, and let Q* be the set of positive rational numbers. Then, Q* is a 
subgroup of R*. 


Example 3.66 Let C* be the group of all non-zero complex numbers under multi- 
plication, and let H = {a + bi | a + b* = 1}. Then, H is a subgroup of C*. 


Example 3.67 Let G be anabelian group. Then, H = {x € G|x” = e} isasubgroup 
of G. 


The following remark is clear. 


Remark 3.68 If H is a subgroup of G and K is a subgroup of H, then K is a 
subgroup of G. 


Example 3.69 Suppose that H = {1, —1, i,, —i} and K = {1, —1}, where z= 
—1. Then, H is a subgroup of C*, the group of non-zero complex numbers under 
multiplication, and K is a subgroup of H. So, we can say that K is a subgroup of 
Cc, 


Example 3.70 Consider the group of integers Z with ordinary addition. Then, the 
set 
nZ = {..., —3n, —2n, —n, 0, n, 2n, 3n, ...} 


for each positive integer n is a subgroup of Z. 


Example 3.71 Let G be the group defined in Example 3.41. Let H be the subset 
of G consisting of all 2 x 2 matrices ¢ “| such that ad — bc = 1. Then, as easily 
verified, H is a subgroup of G. 


Example 3.72 Let C). = {e = a°, a!, a’, ..., a!'}. We define a binary operation 
on Cj as follows: _ 
iad = av ifi+j<12 
a {Gs fej 12, 
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Fig. 3.11 Hasse diagram for Cis 
the family of subgroups of 
ri a % 
My A, 
A 4 
{e} 


It is easy to see that Cj, is a group under the above multiplication. Moreover, all of 
the subgroups of Cj, are as follows: 


{e}, G, 
Hy, = {e, a*, a*, a®, a’, a!) 
Hy = {e, a®, a®, a®}, 

H; = {e, a*, a®} 

Ay = {e, a}. 


In Fig.3.11, we illustrate Hasse diagram for subgroups of Cj2. 


Example 3.73 Let G be any group and H be a subgroup of G. Fora € G, let 
a-'Ha = {a7'ha|h € H}. 


We assert that a~! Ha is a subgroup of G. Let x and y be two arbitrary elements of 
a~'Ha. Then, x = a~'ha and y = a~'h’a, for some h, h' € H. Thus, we obtain 


xy7! = (a'ha)(a7'h'a)~! = (aha) (a7'h'"'a) = a7'hh'""a. 


Since h, h’ € H and H is a subgroup of G, it follows that hh'"' € H. This shows 
that xy~! € a~'Ha, and so a~'Ha is a subgroup of G. 


Theorem 3.74 [If G is a group and {H; |i € I} is anon-empty family of subgroups 
of G, then 

(1) Hi 

iel 
is a subgroup of G. 
Proof Suppose that a and b are two elements of (| H;. Then, a,b € Hj, for 

iel 

all i € I. Since for each i € J, H; is a subgroup of G, it follows that ab € H;,, 
for all i € J. Hence, we conclude that ab“! € ()) H;. This shows that () H; is a 


iel iel 


subgroup of G. a 
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In particular, the intersection of two subgroups of a group is a subgroup. The 
following example shows that the union of two subgroups of a group is not a subgroup, 
in general. 


Example 3.75 Suppose that G = Z. We know that H; = 2Z and H2 = 3Z are sub- 
groups of G. We have 


2¢€H, C H,U Aj and3€ WC A,U I, 


while 2+ 3 = 5 ¢ Hy U Ao. So, H; U Hp cannot be a subgroup of G. 


Definition 3.76 (Center of a Group) The center Z(G) of a group G is the subset of 
elements in G that commute with every element of G, i.e., 


Z(G) = {a € G| ax = xa, for all x € G}. 


Theorem 3.77 The center of a group G is a subgroup of G. 


Proof We use Theorem 3.61 to prove this result. Since e € Z(G), it follows that 
Z(G) is non-empty. Now, assume that a and b be two arbitrary elements of Z(G). 
Then, we have 


(ab)x = a(bx) = a(xb) = (ax)b = (xa)b = x(ab), 


for all x € G. Thus, ab € Z(G). Also, we have 


xa! = exa~! = (a~!a)xa! = a7! (ax)a~! = a7!(xa)a7! 


=a7'x(aa“!) =a7!xe =a™'x, 


for all x € G. This shows that a~! € Z(G) whenever a € Z(G). | 
Corollary 3.78 If G is an abelian group, then Z(G) = G. 


Definition 3.79 Let X be a fixed non-empty subset of a group G. The centralizer 
of X in G, Cg(X), is the set of all elements in G that commute with elements of X, 
1.€., 

Cg(X) = {a € G| ax =xa, for all x € X}. 


Theorem 3.80 The centralizer of anon-empty subset of a group G is a subgroup of 
G. 


Proof The proof is similar to the proof of Theorem 3.77 and it is left to the reader. 
a 


If X = {x}, then we write Cg(x) instead of Cg({x}). In this case, Cg (x) is the 
set of all elements in G that commute with x. 
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Corollary 3.81 We have 


Z(G) = ( Cg (x). 


Proof It is straightforward. | 


Theorem 3.82 Let H, and Hy be two subgroups of G. Then, H, U Hp is a subgroup 
of G if and only if H, © Hy or Hy © Aj. 


Proof If H, © H> or Hy C Hy, then it is clear that H, U H> is a subgroup of G. 
Conversely, let H,; U Hz be a subgroup of G. We prove by contradiction method. 
Suppose that H; Z Hy and H> ¢ Hj. Then, there exist x € H; — H and y € Hy — 
Hy. So, we conclude that x~! € My, y~! € Hy and x, y € Hy U Mp. Since H, U Hy 
is a subgroup, it follows that xy € H, U HM. This implies that xy € Hj orxy € Hy. If 
xy € M,, then y = x~!(xy) € Mj, and this is a contradiction. Similarly, if xy € H2, 
then x = (xy) y—! € Hb, and this is again a contradiction. Therefore, we deduce that 


A, C Ab or Ao C Aj. |_| 

Corollary 3.83 If {H; | i € I} is a chain of subgroups of a group G, then |) H; is 
ie] 

a subgroup of G. 

Exercises 


1. If Gis an abelian group and if H = {a € G| a* = e}, show that H is a subgroup 
of G. 

2. Give an example of a non-abelian group for which the H in Exercise | is not a 
subgroup. 

3. Let G be an abelian group, fix a positive integer n. Let H = {a” | a € G}. Show 
that H is a subgroup of G. 

4. In Example 3.43, let H = {T,,, € G | a is rational}. Show that H is a subgroup 
of G. 

5. If H is a subgroup of G, let 


N= ()x'Hx. 
xeEG 


Prove that N is a subgroup of G such that a~'Na = N, for alla € G. 

6. If H is asubgroup of G such thata~'Ha C H, foralla € G, prove that actually 
a'Ha=H. 

7. Let H and K be two subgroups of G such that a~!'Ha = H anda-'ka= K, 
for alla e G. If HN K = {e}, prove that hk = kh, for anyh €¢ Handke K. 

8. For a non-empty subset X of a group G, define 


k 
xt={T] x |x € x}, 


i=1 
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for any positive integer k. Prove that if G has n element, then X” is a subgroup 
of G. 

9. Let G be a group, and let a € Z(G). Ina Cayley table for G, how does the row 
headed by a compare with the column headed by a? Is the converse of your 
answer true? 

10. Show that x € Z(G) if and only if Cg(x) = G. 
11. Let x, y € Gand let xy = z if z € Z(G). Show that x and y commute. 
12. If G is a group and a, c € G, prove that 


Cg(a7!xa) = a7!Cg(x)a. 
13. Let G be a group and X be a non-empty subset of G. The set 
Ng(X) = {ae G|a!Xa =X} 


is called the normalizer of X in G. If H is a subgroup of G, prove that 


(a) Ng(#) is a subgroup of G; 
(b) H < NG(#). 


14. Give an example of a group G and a subgroup H such that Ng(H) 4 Cg(#). 
Is there any relation between Ng(H) and Cg(H)? 

3.7 Worked-Out Problems 

Problem 3.84 Let G be the group defined in Example 3.41. Find the center of G. 


Solution Suppose that i i is an arbitrary element in the center of G. So, this 


1 . 
1 5 € G, it follows that 


ab ; Ol; |01 ab 
cd 10] |10 cd’ 
ae € G, it follows that 
01 , 


fea} [or}=[oi] [ea] 


aa+b) _ |a+bb+a 
bb+a|— b a : 


. ; . 0 
matrix commutes with all elements of G. Since 


and so a = d and b = c. Similarly, since 


Hence, we get 
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This implies that b = 0. Therefore, each element of the center of G is in the form 


[oe]: 


where a # 0. On the other hand, it is easy to check that for each non-zero real number 


a, the matrix i a belongs to the center of G. Consequently, we have 


z6)={| G4] |@eR #0}, 


and we are done. |_| 


Problem 3.85 Let S = N° x N°. On S we define a binary operation 
(a, b) x (c,d) = (a —b+max{b,c},d-—c+max{b, Gy: 


Prove that (S,*) is a monoid with identity (0,0). This monoid is called Bicycle 
semigroup/monoid. 


Solution Suppose that (a, b), (c,d) € S. Then, we have max{b,c}— b> 0 and 
max{b, c} — c => 0. So, we get a — b+ max{b, c} > a andd —c + max{b, c} > d. 
Consequently, (a — b + max{b, c},d —c + max{b, c} € S, and hence the multipli- 
cation is closed. Clearly, (0, 0) € S and for any (a, b) € S we have 


(0, 0) x (a, b) = (0 — 0+ max{0, a}, b —a + max{0, a}) 
= (a, b) = (a, b) x (0, 0). 


Hence, (0, 0) is the identity element of S. Now, we verify the associativity. Assume 
that (a, b), (c, d) and (e, f) are arbitrary elements of S$. Then, we have 


((a, b) «(c,d)) «(e, f) 

= (a —b+ max{b, c},d —c + max{b, c}) x (e, f) 

= (a—b—d+c+max{d —c+max{b, c}, e}, 
f —e+max{d —c+max{b, c}, e}, e}) 


and 
(a, b) x ((c, d) * (e, f)) 
= (a, b) « (c —d + max{d, e}, f — e + max{d, e}) 
= (a — b + max{b, c — d + max{d, e}}, 
f-—e-c+d+max{b,c—d-+max{d, e}}). 


Now, we must show that 
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a—b—d+c+max{d —c+max{b, c},e} =a—b-+max{b,c —d-+max{d, e}}, 
(3.2) 
and 
f —e+max{d —c+max{b,c},e} = f —e—c+d+max{b,c —d-+max{d, e}}. 
(3.3) 
Equations (3.2) and (3.3) are the same, and so we only need to show that 


—d+c-+max{d—c-+max{b, c}, e} = max{b,c — d+ max{d, e}}. 
But this equality is equivalent to 
max{max{b, c},c —d +e} = max{b,c — d + max{d, e}}, 
and this equality holds, because 
max{b, c— d + max{d, e}} = max{b, max{c -d+d,c—d-+e}} 

= max{b, max{c,c —d+e}} 
= max{b,c,c—d+e} 
= max{max{b, c},c —d +e}. 

Therefore, the multiplication is associative and hence S is a monoid. a 


Problem 3.86 Let G bea group and ag € G. Define a new binary operation * on G 
such that (G, *) is a group with apo as its identity. 


Solution We define * on G as follows: 
x*y= x5 Y, 


for all x, y € G. Let x, y, and z be arbitrary elements of G. Then, we can write 


(x * y) &z = (xapy) * z= xa yay 'z = xaq '(yaq‘z) 
= x * (yay 'z) = x * (y *2). 


Hence, the associative law holds. 

Foranyx € G,x * day = xdp |ao = xe = x, where ¢ is the identity of G. Similarly, 
ay * X = apd 'y = ex =x. This shows that do is the identity element of G with 
respect to x. 

Next, let x € G. Then, we have x * (agx~!ag) = Xap ‘aox!ao = dg. Also, we 
have (aox~!ag) * x = agx~!agag |x = ao. Hence, ayx~'ag is the inverse of x with 
respect to x. 

Therefore, (G, *) is a group with ap as its identity. a 


Problem 3.87 Let G be a group and m, n be relatively prime positive integers such 
that 
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a™b™ = b™a™ and a"b" = b"a". 
for all a, b € G. Prove that G is abelian. 


Solution Since (m,n) = 1, it follows that there exist integers x and y such that 
mx +ny = 1. Then, we have 


(a™bry® = (a™b" Cab")... (a™b") 

a” (G"a™)@"a™) = (b"a"))b" 
— a™(b"a™)"* (b"a’”)—!b” 

a™ (bay q—-™b-"p” 
=a™(b"a")™aq-™ 

ae aay a” 

— (pray) aa 


= (b"a"")"*, 
This shows that 
(a™b"y"™* = (b"a”)™, (3.4) 
In a similar way, we see that 
(ab")"” = (b" a)”. (3.5) 


From (3.4) and (3.5), we obtain 
a™b” _ (a™ brynxtry _ (bam ymxrny — ba™. 
Finally, we can write 


ab = qmxtny pmxtny = qm (a” b™*)b” 
= g™ hb gry pry = pl gmt Hry gry 
_ b™* HY qi'X qny _ pinxtny qgmx+ny —_ ba. 


This completes the proof. | 


Problem 3.88 If G is a group in which 
(ab)‘ = ak bk (3.6) 
for three constructive integers k and for all a, b € G, show that G is abelian. 


Solution Suppose that (3.6) is true for n, n + 1, and n + 2, where n is an integer, 
1.€., 

(ab)” = a"b", 

(ab)"*! _ qghtlpntl 

(aby'*2 23 qt t2prt2. 
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Then, we obtain 
q?tipntl = (ab)"*} oS (ab)"ab = q"b" ab. 


By cancellation law, we get 
ab" = b"a. (3.7) 


On the other hand, we have 
at2pnt2 — (ab)"t? = (ab)"(ab)* = a"b"abab. 
Again, by cancellation law, we obtain 
a’b"*! = b"aba. (3.8) 
From (3.7) and (3.8), we have 


a*b"t! = b"aba = ab"ba = ab"*!a. 


This yields that 
abt! = p""q, (3.9) 


Now, from (3.7) and (3.9), we conclude that ab"t! = b"*+!a = bb"a = bab". This 
shows that ab = ba. a 


3.8 Supplementary Exercises 


1. Let G be the set of all real 2 x 2 matrices where ad — be # Oisarational 


ab 
cd) 
number. Prove that G forms a group under matrix multiplication. 


2. Let G be the set of all real 2 x 2 matrices where ad # 0. Prove that G 


ab 
Od)’ 
forms a group under matrix multiplication. Is G abelian? 


3. Let G be the set of all real 2 x 2 matrices k = | where a # 0. Prove that G 


is an abelian group under matrix multiplication. 

4. A semigroup S is called regular if for each y € S there exists a € S such that 
yay = y. Let S be a semigroup with at least three elements and x € S is such 
that S \ {x} is a group. Prove that S is regular if and only if x? = x. 

5. Let x1, x2, ..., X, be elements of a group G. Prove that there are 


(2n — 2)! 
n\(n — 1)! 
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ways of evaluating the product of x1, x2, ..., X, in that order. All these ways 
yield the same group element which can thus be denoted unambiguously by 
X1XQ...Xy. 


. Let G be a finite group. Show that there are an odd number of elements x of G 


such that x? = e. Show that there are an even number of elements y of G such 
that y? Ae. 


. Let G be a group with an odd number of elements. Prove that for each a € G, 


2 


the equation x“ = a has a unique solution. 


. Show that the conclusion of Problem 3.88 does not follow if we assume the 


relation (ab)* = a*b* for just two constructive integers. 


. Let G be a group, a, b € G and m, n be two integers. 


(a) If b-'ab = a", show that b~!a"b = a" and b-"ab" =a"; 
(b) Ifa~'b*a = Bb and b~!a?b = a’, prove that a = b = e. 


Let G be the set of all positive rational numbers of the form 


ath 
c2 +d?’ 


where a, b, c, and d are integers. Is G a group, being ordinary multiplication? 


. A group G satisfies the ascending chain condition (ACC) on subgroups if every 


ascending sequence 
A, <Wb<... 


of subgroups must eventually be constant, that is, if there is an n > 0 such 
that H,+, = H, for all k > 0. A group G satisfies the maximal condition on 
subgroups if every non-empty family of subgroups has a maximal member. 
Prove that a group G satisfies the maximal condition on subgroups if and only 
if it satisfies the ascending chain condition on subgroups. 


. Let G be a group of order n and let m be an integer relatively prime to n. Show 


that if x” = y”, then x = y. Hence show that for each z € G there is a unique 
x € G such that x” = z. 


Chapter 4 ®) 
Cyclic Groups cies 


The simplest type of groups are group of integers modulo n and cyclic groups. 
Cyclic groups are groups in which every element is a power of some fixed element. 
In this chapter, we examine cyclic groups in detail and determine their important 
characteristics. We observe that a cyclic subgroup with generator a is the smallest 
subgroup containing the set A = {a}. Can we extend subgroups generated by sets 
with more than one element? We answer this question in this chapter too. 


4.1 Group of Integers Modulo n 


Let a, b € Zand n be a positive integer. Already, we defined 
a=b(modn) ©} nila—b. 


This relation is named as congruence modulo n. In Lemma 1.62, we proved that this 
relation is an equivalence relation. The equivalence class of a € Z is the set 


a= {be Z|a=b(modn)}. 


The integers modulo n partition Z into n different equivalence classes. We denote 
the set of these equivalence classes by Z,,. Thus, 


Y210. Ici 1h 
For instance, if n = 5, then 
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2021 87 


B. Davvaz, A First Course in Group Theory, 
https://doi.org/10.1007/978-98 1- 16-6365-9_4 


88 4 Cyclic Groups 


+10hr = 


Fig. 4.1 5+10=3 


OS {si =15>=10,-=5,. 0, 5, 10, 15;..), 
1S{...,-14, 0, =4, 1,6, 11, 16,...3, 
D2. 219 28, 9. 27. Pe 17 
G1. 1d 7 0 a eS: eA 
AS... “1, 6. 1, A @ 14, 10-32. 


and 


%= 0, 1, 2, 3, 4); 


Theorem 4.1 (Z,,, +) is an abelian group, where + is the addition modulo n, i.e., 
a+b=a+b, foralla,b € Z. 


Proof First we show that + is well defined on Z,,. If @ = Gy and b, = by, then there 
exist integers g; and q2 such that a; — az = qin and by — bz = qzn. So, we have 


(a, + b)) — (2 + bg) = ay — an + by — bg = Qyn + Gon = (qi + qo)n. 


This implies that a; + bj = a2 + b2(mod n), or equivalently aq, +b, =a,+)ho. 
Clearly, addition on Z, is associative and commutative. 0 is the additive identity 
for Z,, and each a € Z,, has an additive inverse —a in Z,,. Note that —a =n —a.@ 


Sometimes, when no confusion can arise, we use 0, 1,...,m — 1 to indicate the 
equivalence classes 0, 1,...,” —1. 


Example 4.2 Forn = 12, we have3+6=9,8+9=5,6+7=1and5+10= 
3. A familiar use of modular arithmetic is in the 12-hour clock (see Fig. 4.1, in which 
the day is divided into two 12-hour periods). 


Example 4.3 The Cayley table for (Zs, +) is as follows: 
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We can also multiply elements of Z,,. Given two elements a and bin Z,,, we define 


a-b=ab. 


This is well defined and with respect to it, Z, becomes a commutative monoid. 

A multiplicative inverse for a € Z, is an element b € Z,, such that a@-b = 1. An 
element a € Z, is a unit if it has a multiplicative inverse in Z,. In other words, the 
integer a is a unit modulo n, meaning that ab = 1(mod n) for some integer b. 

By the above operation we do not obtain a group. For instance, 0 does not have a 
multiplicative inverse. 


Lemma 4.4 @ is a unit in Z,, if and only if (a, n) = 1. 


Proof Ifaisaunit, thenab — 1 = gn forsome integers b andqg,andsoab + qn = 1. 
Now, by Theorem (1.51) we conclude that (a, n) = 1. 

Conversely, if (a,n) = 1, then by Theorem (1.51) there exist x, y € Z such that 
1 = ax + by. Thus, x is a multiplicative inverse of a. a 


Let U,, denote the set of units of Z,. 


Theorem 4.5 For each integer n > 1, the set U, forms an abelian group under 
multiplication modulo n, with identity element 1. 


Proof We first show that the product of two units @ and b is also a unit. If @ and b 
are units, then they have inverses c and d, respectively, such that 


a-c=ac=1andb-d=bd=1. 


Hence, we obtain that 


ab-cd =abcd = achd =ac-bd=1-1=1. 


Consequently, ab has the inverse cd, and is a unit. Moreover, for each a, b,  € Un, 
we have 
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Table 4.1 A short table of values g(n) 
n 1 2 3 4 5 6 7 8 9 10; 11} 12} 13 | 14} 15 
g(n)| 1 1 2 2 4 2 6 4 6 4 10} 4 12 | 6 8 


The identity element of U,, is 1, since @-1 =a =1-G, forall ae Uj. Finally, if 
a € U,, then by the definition of U,,, there exists c € Z, such thata-¢c = 1=C-@. 
This yields that c € U,, and c is the inverse of a. a 


Definition 4.6 Let n be a positive integer. The Euler function ¢ is defined to be the 
number of positive integers not exceeding n which are relatively prime to n. 


Table 4.1 is a short table of values y(n): 
Corollary 4.7 For each integer n > 1, |U,| = y(n). 
Proof It follows from Lemma 4.4 and Theorem 4.5. a 


Example 4.8 In Z 4, the group of units is Uj4 = {1, 3,5, 9, 11, 13}. The operation 
in Uj, is multiplication modulo 14. The following is Cayley table for Uj4: 


13 5 9 1 13 
17/1 3 5 9 I 13 
3);3 9 1 13 5 II 
5}5 1 11 3 13 9 
9;9 13 3 11 1 5 
1Wjll 5 13 1 9 3 
13}13 11 9 5 3 #1 


Example 4.9 If p is prime, then all positive integers smaller than p are relatively 
prime to p. Hence, Up = {1, 2,..., p— 1}. 


Theorem 4.10 Let p > 1 be an integer. Then the following statements are equiva- 
lent. 


(1) p is prime; 7 
(2) Ifa-b=0inZ,, thena =0orb=0. 
Proof (1 => 2) Since p is prime, it follows that U, = Z, — {0} is a group. Assume 


that a -b = 0 in Z,. If a = 0, then there is nothing to prove. Ifa # 0, then @ € Up. 
So, there exists x € U, such that x -a = 1. Therefore, we get 


0=x-0=x-G@-b)=(-@)-b=1-b=b. 
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(2 = 1) Suppose that p = ab for s some integers a and b. We show a = £1 or +p. 
Since p = ab, it follows that ab = p = 0. This implies that a -b = 0 in Zp. Now, 
by (2) we conclude that a = 0 orb = a If a = 0, then pla, or scnitvalently ¢ a=kp 
for some integer k. Hence, we have p = ab = kpb. This implies that 1 = kb. Since 


k and b are integers, it follows that the only possibilities are b = +1, and soa = +p. 
If b = 0, then a similar argument shows that a = +1. Therefore, p is prime. | 
Exercises 

1. Solve equation x? +4x+8=0inZj). 


2. Find the number of distinct solutions to the equation x? + (—1) = 0 in Zs, Z 3, 
and in Zo. 
. Find, if possible, a multiplicative inverse for 8 in each of Zs, Zo1, and Zora. 
4. Find a positive integer n and three elements @, b, and b in Z, such that none of 
a-b,b-¢,andZ- a is equal to 0, yeta: b-c = 0. 
5. Given a, b € Zn, we say that b is a square root of aif b-b =a. 


io) 


(a) Find all square roots of elements in Z7, if exist; 

(b) If p is any prime, show that a has at most two square roots modulo n; 

(c) Give an example that shows that it is possible for a number to have more than 
two square roots. 


4.2 Cyclic Groups 


This section contains a few observations about cyclic groups. The structure of cyclic 
groups is relatively simple. We examine cyclic groups in detail and determine many 
of their important characteristics. 

If G is a group, a € G andk € Z, then we define 


aa...a ifk>0 
—__-— 
k times 

a"=e ifk=0 


aa!...a7' ifk <0. 
$$, 


—k times 


Lemma 4.11 /fG is a group anda € G, then H = {a" |n € Z} is a subgroup of G. 


Proof It is straightforward. a 
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Fig. 4.2, An examples of 
string art to show that 5 is a 
generator of Z1 


Definition 4.12 Let G be a group. A subgroup AH of G is cyclic if H = {a" |n € Z}, 
for some a € H. In this case we say that H is the cyclic subgroup generated by a. 


When this happens, we write H = (a). 


Definition 4.13 A group G is called cyclic if there is an element a € G such that 
G = (a). Such an element a is called a generator of G. 


Example 4.14 The set of integers under addition is an example of an infinite group 
which is cyclic and is generated by both | and —1. 


Example 4.15 If K4 = {e, a, b, c} is the Klein’s 4-group, then H = {e, a} isa 
cyclic subgroup of K4; however, K4 is not a cyclic group. 


Example 4.16 The set Z, = {0,1,...,n— 1} for n > 1 is a cyclic group under 
addition modulo n. Both | and —1 are generators. 


Depending on which n we are given, Z, may have many generators. 


Example 4.17 Z). = (1) = (5) = (7) = (11). To verify for instance that Z). = (5) 
we can use a string art. Figure4.2 is an example of string art that illustrates how 5 
generates Z12. Twelve tacks are placed along a circle and numbered. A string is tied 
to tack 0 and then looped around every fifth tack. As a result, the numbers of the 
tacks that are reached are exactly the ordered numbers of 5 modulo 12. Note that if 
every seventh tack were used, the same artwork would be obtained. If every third 
tack were connected, as in Fig. 4.3, the resulting loop would only use four tacks, and 
so 3 does not generate Z 2. 


Example 4.18 An nth root of unity is a complex number z which satisfies the equa- 
tion z” = 1 for some positive integer n. Let w, = e?7'/" be an nth root of unity. All 
the nth roots of unity form a group under multiplication. Itis acyclic group, generated 
by @,, which is called a primitive root of unity. The term “primitive” exactly refers to 
being a generator of the cyclic group, namely an nth root of unity is primitive when 
there is no positive integer k smaller than n such that ok = 1. See Fig.4.4 forn = 5. 
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Fig. 4.3, An example of string art to show that 3 is not a generator of Z12 


y 


2 
Fig. 4.4 5th roots of unity, where a = = 


Definition 4.19 The order of an element a in a group G is the least positive integer 
n such that a” = e. In additive notation this would be na = 0. If no such integer 
exist, then we say a has infinite order. The order of an element a is denoted by o(a). 


Note that the critical part of Definition 4.19 is that the order is the least positive 
integer with the given property. The terminology order is used both for groups and 
group elements, but it is usually clear from the context which one is considered. 
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So, to find the order of an element a we need only compute the series of products 
a, a’, a°,... until we first reach the identity. If the identity never appears in the 


series, then a has infinite order. 


Definition 4.20 A torsion group is a group all of whose elements have finite order. 
On the other hand, a group is said to be torsion-free if all its non-identity elements 
have infinite order. 


Lemma 4.21 Leta 4 e be an element of a group G. Ifn is the order of a and ak = e 
for some integer k, then n divides k. 


Proof By the Division algorithm, there exist integers g andr such that k = gn+r 
and r is strictly smaller than n. So, we have 
e=aq’ =a" = aq". 

Since a has order n, it follows that a"? = (a")? = e. This implies that a’ =e, 
which contradicts our choice of n as the smallest positive integer for which a power 
of a is the identity, unless r = 0. In this case k = ng and we conclude that n must 
divide k. a 


Theorem 4.22 Ifa € G has order n, then 


n 


(n,k) 


o(a’) = 


Proof Suppose that (n, k) = d. Then, there exist integers u and v such that k = ud, 
n = vd and (u, v) = 1. Now, we have 


(a’)” = (a'?)” = qtvd =q"=— (a")" =e. 


Moreover, if (a*)” = e,thenn |km. So, vd|udm, or equivalently v|um. Since (u, v) = 
1, by Euclid’s lemma we conclude that v|m. Therefore, we obtain o(a*) = n/d. 
We now turn to some results about cyclic groups. 


Theorem 4.23 Every cyclic group is abelian. 


Proof Suppose that G is a cyclic group. By definition, there exists a € G such that 
G = (a). We need to show that xy = yx, for all x, y € G. Now, x and y are of the 
form x” and y”, for some integers m and n, respectively. Moreover, we have 


qt = qiutm 


xy=a™"a" = =a"a" = yx, 
as desired. a 


Theorem 4.24 Every subgroup of acyclic group is also cyclic. 
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Proof Let G = (a) be our cyclic group and suppose that H is a subgroup of G. We 
must show that H is cyclic. If it consists of the identity alone, then it is trivial subgroup 
and there is nothing to prove. So, we may assume that H # {e}. We consider the set 


S={seN|a’ eH}. 


Since H is non-trivial, it follows that S is non-empty. But being a non-empty subset 
of positive integers, S must have the least positive integer, call it k. We next claim 
that H = (a*). In order to prove this claim, it suffices to let / be an arbitrary element 
of H and show that h € (a“). Since h € G = (a), it follows that h = a’", for some 
integer m. We now invoke the Division algorithm to m and k to obtain integers q and 
r such that m = kq +r, where 0 <r < k. Therefore, we have 


h=a"= qhatr = aq’. 


This implies that 


a= a k4q™ . 


As ak belongs to H, so does a~*’. By closure property of a subgroup, we know that 
a~*4qa™ = a" belongs to H. As r is strictly smaller than k, it contradicts the choice 
of k as the smallest element of S, unless r = 0. This proves that h = at =(a)tle 
(a*) = H. | 


Remark 4.25 Theorem 4.24 tells us that all subgroups of additive group Z are of 
the form nZ = {nk | k € Z}. 


Theorem 4.26 Let G be a finite cyclic group generated by a. 


(1) The generator a has finite order; 
(2) Ifo(a) =n, then the elements a‘ with k = 0,1,...,n — 1 are all distinct and 
G ={e, a, a’, ...,a"}}. 


Proof Since G is cyclic, it follows that 


G = (a) ={..., a?, a~!, e, a, a’, } 
Since G is finite, it follows that for some integers k and m we have a” = a‘. This 
implies that a’”~* = e and so a has finite order. 
If o(a) =n, then we claim that G = {e, a, a”, ...,a"~'}. We see that for 0 < 
m <k<n-—1, the elements a” and a* are distinct as otherwise a’”—* = e with 
|m — k| <n. Consequently, {e, a, a”, ...,a"~'} is a subgroup of G containing 


n elements. Now, if x is an arbitrary element of G, then x = a/, for some integer 
j. If j > 0, then applying the Division algorithm, we obtain a/ = a", where 0 < 
r<n-—1.If j <0 ands = —j, then we can make a* = a’, for some r < n and 
ai =a~ = (a‘)"! = (a')"! =a". This yields that G = {e, a, a’, ...,a"~}. 
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Therefore, we conclude that the cyclic group generated by an element of order n has 
precisely n elements. | 


Corollary 4.27 Let G be a group and a be an element of G. Then, 
o(a) = |(a)|. 


Corollary 4.28 Let G be a finite group of order n. Then, G is cyclic if and only if 
there exists an element of G of order n. 


Corollary 4.29 If G is an infinite cyclic group, then G has an infinite number of 
subgroups. 


Proof If G = (a), then (a”) 4 (a), for each integer n 4 m. a 


Theorem 4.30 Let G be a cyclic group of order n generated by a and let k be a 
positive integer. If d = (n,k), then x* and x generate the same cyclic subgroup of 
G. 


Proof If d = (n, k), we show that a‘ and a@ are both powers of each other. Since d|k, 
it follows that aX = (x“)*/4. On the other hand, since d = (k,n), it follows that d = 
kr +ns, for some r, s € Z. This implies that at = qkt+"s = gg’ = a" = (a*)’. 
This yields that a* and a“ generate the same subgroup of G. a 


Theorem 4.31 Let G be a cyclic group of order n. Then, for each m dividing n, G 
has a unique subgroup of order m, namely (a"!"). 


Proof If k = n/m, then (a*)” = (a"/")” = a" = e and no smaller positive integer 
power of a‘ could be e. So, (a*) is a subgroup of order m. Next, we show that 
(a*) is the only subgroup of order m. In order to do this, let H be an arbitrary 
subgroup of G of order m. By the proof of Theorem 4.24, we have H = (a“), 
where d is the smallest positive integer such that a“ is in H. We apply the Division 
algorithm to obtain integers g andr such thatn = dq +r and0 <r < d. Then, we 
have e = a" = a“@*" = (a*)4a". This implies that a” = (a4)~4 € H. Consequently, 
r =O and son = dq. Moreover, we have 


m =|H|={Ka%)| == 

= = =o 
Therefore, we obtain d=n/m=k, ie., H = (a‘) = (a*). This completes 
the proof. | 


Theorem 4.32 (Generators of a Finite Cyclic Group) Let G be a cyclic group of 
order n generated by a. Then, G = (a*) if and only if k and n are relatively prime. 


Proof Suppose that k and n are relatively prime. Then, we may write 1 = kx + ny, 
for some integers x and y. Then, we have 
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a= qk tny = a’ qty = a = (a’)*. 


This implies that a belongs to (a*), and so all powers of a belong to (a). Thus, we 
conclude that G = (a*) and a is a generator of G. 

Conversely, suppose that k and n are not relatively prime. Then, there exists an 
integer d > 1 such that d|n and d|k. Hence, there exist integers m and r such that 
n = md and k = dr. So, we get 


(aky" = quam =q’= (a”)" =e. 


Since d > 1, it follows that m <n. This shows that a" is not a generator of G. Ml 


Corollary 4.33 Let G be a cyclic group of order n generated by a. Then, G has 
(n) generators. 


Corollary 4.34 Let k be a positive integer. Then, k is a generator of Z,, if and only 
if k and n are relatively prime. 


Theorem 4.35 Let G be acyclic group of order n generated by a. If d is a positive 
divisor of n, then the number of elements of order d is $(d). 


Proof By Theorem 4.31, there exists exactly one subgroup of order d, say (b). 
Then, every element of order d generates (b). On the other hand, by Theorem 4.372, 
an element b* generates (b) if and only if (k, d) = 1. The number of such elements 
is p(d). | 


Theorem 4.36 Every infinite cyclic group has exactly two generators. 


Proof Suppose that a is a generator of the infinite cyclic group G. Then, we have 
G={ee55 a’, a7', e, a, a’,.. .}. Now, if a’ € G is another generator of G, 
then G = {..., ak, ak, e, ak, a... .}. Since a‘! &€G, it follows that there 
exists an integer m such that a‘+! = a‘, This implies that a*'-”+! = e. Since G is 
infinite, it follows that k(m — 1) + 1 = 0, or equivalently k(m — 1) = 1. This yields 
that k = +1. Consequently, if a is a generator of G, then another generator of G is 
only a7}. a 


For any finite group G, let 6g(d) be the number of elements of G of order d. 


Then, it is clear that 
IG| 


|G| = Y > Oc(d). 
d=1 


Theorem 4.37 Jn a finite group (not necessary cyclic), 0g(d) is divisible by p(d). 


Proof Let G be a finite group. If G has no element of order d, then g(d)|0 = 0g (d) 
and we are done. So, suppose that there is a € G such that o(a) = d. By Theorem 
4.35, (a) has p(d) elements of order d. Now, if all elements of order d belong 
to (a), we are done. Hence, suppose that x € G such that o(x) = d and x € (a). 
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Then, (x) has g(d) elements of order d too. Next, if (a) and (x) have no elements 
of order d in common, we have found 2¢(d) elements of order d. Note that if 
there is y € (a) N (x) with o(y) = d, then we obtain (a) = (x) = (y), and this is a 
contradiction. Continuing, we conclude that the number of elements of order d is a 
multiple of g(d). | 


Exercises 


—_ 


. Draw the Hasse diagram for a cyclic group of order 30. 

2. Show that the elements of finite order in an abelian group G form a subgroup of 
G. 

3. Let U24 be the group of invertible elements in Zo4. Find all cyclic subgroups of 
Ung. 

4. Find an example of a non-cyclic group, all of whose proper subgroups are cyclic. 

5. Find a collection of distinct subgroups (a1), (a2), ..., (dn) with the property that 
(a1) < (ax) <... < (ad,) with n as large as possible. 

6. Let G be a group and a, b € G. Prove that 


(a) o(a) = o(a™'); 

(b) o(ab) = o(ba); 

(c) o(a) = o(g~!ag), for all g € G; 

(d) If ab = ba, and o(a) and o(b) are relatively prime, then o(ab) = o(a)o(b). 


7. If acyclic group has an element of infinite order, how many elements of finite 
order does it have? 

8. Give an example of a group that has exactly 6 subgroups (including the trivial 
subgroup and the group itself). Generalize to exactly n subgroups for any positive 
integer n. 

9. If G is an abelian group and contains cyclic subgroups of orders 4 and 5, what 
other sizes of cyclic subgroups must G contain? Generalize. 

10. If G is an abelian group and contains a pair of cyclic subgroups of order 2, show 
that G must contain a subgroup of order 4. Must this subgroup be cyclic? 

11. Let G be an abelian group. Show that the elements of finite order in G form a 
subgroup. This subgroup is called the torsion subgroup of G. 

12. Find the torsion subgroup of the multiplicative group R* of non-zero real num- 
bers. 

13. Find the torsion subgroup T of the multiplicative group C* of non-zero complex 
numbers. 

14. Let G be an abelian group of order mn such that m and n are relatively prime. If 
there exists a, b € G such that o(a) = m and o(b) = n, prove that G is cyclic. 

15. Show that both U5 and U27 are cyclic groups. 

16. Prove that Uz. (n > 3) is not cyclic. 

17. Let a and b be elements of a group. If o(a) = m, o(b) =n, and m and n are 
relatively prime, show that (a) M (b) = {e}. 
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18. Suppose that o(x) = n. Find a necessary and sufficient condition on r and s such 
that (x") < (x*). 

19. Suppose that o(x) = n. Show that (x”) = (x°) if and only if (n,r) = (7, s). 

20. Suppose that a is a group element such that o(a78) = 10 and o(a”*) = 20. Deter- 
mine o(a). 

21. Determine the Hasse diagram for subgroups of Z,2,, where p and q are distinct. 

22. Give an example of a group and elements a, b € G such that o(a) and o(b) are 
finite but o(ab) is not of finite order. 


4.3 Generating Sets 


We are interested in answering the following questions: What is the smallest subgroup 
of a group G containing elements x, ..., x, € G? How can we describe an arbitrary 
element in this subgroup? Or, more generally, what is the smallest subgroup of a group 
G containing a subset A of G and how can we describe an arbitrary element in this 
subgroup? 


Definition 4.38 Let G be a group and A be a subset of G. Let {H; | i € I} be the 
family of all subgroups of G which contain A. Then 


0) Ai 


ie] 
is called the subgroup of G generated by the set A and denoted by (A). 


Indeed, (A) is the smallest subgroup of G that contains A. The elements of A are 
the generators of the subgroup (A), which may also be generated by other subsets. 
If A = {x,,..., xX,}, then we write (x,,...,X,) instead of (A). If G = (x1,..., Xn), 
then G is said to be finitely generated. 


Remark 4.39 If A = {a}, then G = (a) is acyclic group. 
Theorem 4.40 /f G is a group and A is a non-empty subset of G, then 
(A) 2 ee |x; € A, a €Z, 1<i<n, ne N}. 
Proof Let 
H = {xy ...x% |x, € A, a EZ, 1<i<n, neN}. 
Clearly, H isanon-empty subset of G. Suppose that x and y are two arbitrary elements 


of H. Then, there exist x1,...,%n, V1,---,¥m € A anday,..., Qn, Bi,---,Bm € Z 


such that 


_ Qn _ Bi Bn 
X= x,'...x9" and y= yy... Ym". 
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Hence, we obtain 
1 oe On 7 Pm . —Bi cH. 


XY =X, ...X, 

Thus, H is a subgroup of G. Moreover, A C H. Consequently, by the definition of 
(A), we conclude that (A) C H. 

On the other hand, since (A) is a subgroup of G, it follows that all elements of the 

form x}! ...x% (for x; € A and a; € Z) belong to (A). This yields that H C (A). 

Therefore, we deduce that (A) = H. a 


Remark 4.41 If G is an additive group and A is a non-empty subset of G, then 
(A) = {ayx, +...Q,xX, |x; € A, a EZ, 1<i<n, ne N}. 


Example 4.42 Every finite group is finitely generated since G = (G). 


Example 4.43 The additive group Q is not finitely generated: a finite set of rational 
numbers has a common denominator, say m, and the subgroup of Q generated by 
these rational numbers (their integral multiples and sums thereof) will only give 
rise to rational mas with denominators dividing m. Not all rationales have such 
denominators (try | Saas so Q does not have a finite set of generators as an additive 


group. 


Example 4.44 A finitely generated group is at most countable, so an uncountable 
group is not finitely generated. For example, the group of real numbers under addition 
is not finitely generated. 


If G is a group and { H; |i € J} is a family of subgroups of G, then U H; is nota 
subgroup of G in general. The subgroup ( U H;) generated by the set U 1 is called 


the subgroup generated by groups { H; |i é 7 }. If H and K are auheeoups of G, then 
the subgroup (H U K) generated by H and K is called the join of H and K and is 
denote by Hv K. 


Theorem 4.45 /fG isa group, A is a subgroup of Z(G), the center of G, andz € G, 
then (A U {z}), the subgroup generated by A and z, is abelian. 


Proof Let H = (A U {z}) and y be an arbitrary element of H. First, we show that y = 
az‘, for some a € A and integer k. By Theorem 4.40, we can write y = ajd2...dn, 
where for each 1 <i <n, a; € A, a; =z or a4; = z~!. Since A is a subgroup of 
Z(G), it follows that a), a2, ..., @, commute. We set all a;s that belong to A on the 
left side and hence other elements will be z or z~!. If a is the product of a;s that are 
in A, then y = az* for some integer k. 

Now, suppose that x and y are two arbitrary elements of H. Then, we have 


x =a,z" and y= az, 


for some a), a2 € A and integers k;, kz. Thus, we obtain 
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xy = (ayz")agz 
ap (az*')z* 
= ana, zzk 
= ayzka, zhi 
yx. 


This yields that H is an abelian subgroup of H. a 


Exercises 


1. In (Z, +), determine the subgroup generated by 


(a) {4, 6}; 
(b) {4, 5}. 


2. Let G be a group and X be a non-empty subset of G. For every a € G, prove that 
(a“!Xa) =a7"(X)a. 

3. Prove that group generated by G \ H equals G, where H is a proper subgroup of 
G. 

4. Let G be an abelian group and A = {aj,..., a,} be a finite subset of G. If the 
order of each element of A is finite, prove that (A) is a finite subgroup of G. 

5. Suppose that G is a group containing subgroups H and K. Show that HU K = 
HY K ifandonlyif HK e€ {H, K}. 

6. Show that there is a group G containing subgroups H, K, L such that H U K U 
L=HVKVLbutHOKNOL€{H, K, L}. 


4.4 Worked-Out Problems 


Problem 4.46 Let G be an abelian group and x, y € G be of finite orders. 


(1) Show that o(xy)|[o(x), o(y)], where [o(x), o(y)] stands for least common mul- 
tiple; 
(2) Give an example to illustrate that o(xy) 4 [o(x), o(y)], in general. 


Solution (1) Assume that x and y are two elements of orders m and n, respectively. 
Let [m,n] = r. Then, we have r = mm’ = nn’, and hence 

x"y" = (xy =e. 
Now, suppose that o(xy) =k, then k is the smallest positive integer k such that 


(xy)* = e. Since G is abelian, it follows that 
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(xy) =x" y" = e and (xy)* = x" y* =e. 


This shows that (xy)* = (xy)’ =e. Since o(xy) = k, we conclude that k|r. This 
completes the proof. 

(2) For this part, let x be a non-identity element of G and let y = x~!. Then, it is 
clear that 1 = o(e) = o(xx7!) = o(xy) F [o(x), o(y)]. a 


Problem 4.47 Prove that a group is finite if and only if it has only finitely many 
subgroups. 


Solution Suppose that G is a finite group of order n. Since the number of subsets of 
G is 2”, it follows that the number of subgroups of G is less than 2”. 

Conversely, suppose that the number of subgroups of G is finite. Let a € G be 
an arbitrary element. We claim that (a) is finite. If (a) is an infinite group, then the 
order of a is infinite. Now, for each positive integer k, let Hy, = (a*). We prove Hys 
are distinct. Indeed, if H; = H;, for some positive integers m and n, then ai € (a!) 
and a/ ¢€ (a'). So, there exist integers r and s such that a! = a/" and a/ = a’. This 
yields that a'~/” = e and a/~"* = e. Since the order of a is infinite, it follows that 
i— jr =O and j —is =0, or i|j and j|i. Hence, i = 7. Consequently, H;,s are 
distinct. This means that G has many infinitely subgroups, and it is a contradiction. 
Therefore, (a) is finite, for every a € G. Since 


G=U@) 


acG 


and the number of subgroups of G is finite, we conclude that 
G = (a1) U (az) U... U (ay). 


Since the finite union of finite sets is finite, it follows that G is finite. |_| 


4.5 Supplementary Exercises 


1. LetG = {aj, a, ..., a,} bea finite abelian group of order and x = ajd2... ap 
be the product of all the elements in G. Show that x? = e. 
2. Suppose that the elements x, y in the group G satisfy the relation xyx~! = y’, 


where y is a non-identity element. 


(a) Show that x*yx7> = y*?; 
(b) If o(x) =5, compute o(y). 


3. Let G be the collection of all rational numbers x which satisfy 0 < x < 1. Show 
that the binary operation 


x+y ifO<x+y<1 


reya {rT ifx+y>1. 
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10. 


11. 


makes G into an infinite abelian group all of whose elements have finite order. 
Let G be a group and x € G such that o(x) = mn, where m and n are positive 
integers and relatively prime. Show that one can write x = ab, where o(a) = m, 
o(b) = nand ab = ba. Moreover, prove the uniqueness of such a representation. 
Give an example of an infinite group that has exactly two elements of order 4. 
For every integer n greater than 2, prove that the group U,,2_ is not cyclic. 

Let G be an abelian and suppose that G has elements of orders m and n, respec- 
tively. Prove that G has an element whose order is the least common multiple of 
m and n. 

Let G be a finite group. Prove that the following conditions are equivalent: 


(a) G is cyclic and |G| = p”, where p is prime and n is a non-negative integer; 
(b) If A and B are subgroups of G, then A < Bor B <A. 


Prove that the additive group of all rational numbers is a torsion-free group and, 
further, that it can be represented as a union of an ascending chain of cyclic 
subgroups. 

Suppose that G is a finitely generated group and that G = (Y), where Y is not 
necessarily a finite set. Prove that there is a finite subset X of Y such that G = (X). 
Prove that a group G is finitely generated if and only if any increasing sequence 
H, < Hy < ... of subgroups of G stabilizes, i.e., Hj = H; fori, j => k starting 
from some k. 


Chapter 5 M®) 
Permutation Groups cies 


In this chapter, we construct some groups whose elements are called permutations. 
Often, an action produced by a group element can be regarded as a function, and 
the binary operation of the group can be regarded as function composition. The 
symmetric group on a set is the group consisting of all bijections from the set to itself 
with function composition as the group operation. These groups will provide us with 
examples of finite non-abelian groups. 


5.1 Inverse Functions and Permutations 


In this section, we study certain groups of functions called permutation groups. 


Theorem 5.1 If f : X —~ Y,g:Y — Zandh: Z => W are functions, then their 
compositions are associative, i.e, (ho g)o f =ho(go f). 


Proof It is straightforward. | 


Definition 5.2 For any non-empty set X, the identity function is the function idy : 
X — X defined by idy(x) = x, forall x € X. 


Clearly, if f : X — Y is any function, then f oidy = f andidyo f = f. 


Definition 5.3 Let f : X — Y bea function. We say that f has an inverse function 
if there exists a function g : Y + X such that f og =idy and go f =idy. 


Theorem 5.4 [fa function f : X — Y has an inverse, then this inverse is unique. 


Proof Suppose that g and h are both inverses for f. Then, we have go f=ho f= 
idy and fog = f oh =idy. Thus, we obtain 
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h=hoidy=ho(fog)=(ho f)og=idyog=g. 


This yields that the inverse of f is unique. a 
The inverse of f is denoted by f—!. 
Theorem 5.5 A function f : X — Y has an inverse if and only if f is a bijection. 


Proof Assume that f is a bijection. We define a function g : Y > X as follows: 


s(yy=x > f(x) =y. 


Since f is one to one, it follows that g is a function. Now, by the definition, f o g = 
idy and go f =idy. 

Conversely, suppose that f has an inverse f~!. First, we show that f is one to 
one. If f(x1) = f (x2), then 


SMH =f CM) =f GS) —ide a) Se, 


hence f is one to one. In order to show that f is onto, take any y € Y. Then, 
y =idy(y) = fof (y= fF '0)) = f@), 


where x = f~!(y). So, f is onto. a 


Theorem 5.6 If f : X — Y and g: Y > Z are bijections, then so is the function 
sof. 

Proof Suppose that f and g have inverse functions f—! and g~', respectively. Then, 
we obtain (go f)(f-!og7!)=idz and (f-!og™!)(go f) =idx. 
Hence, the inverse of go f is f-'og~!. Consequently, by Theorem 5.5, go f 
is a bijection. a 


1 


Definition 5.7 Let X be a non-empty set. A bijective function from X to itself is 
called a permutation of X. 


For an arbitrary non-empty set X we define Sx to be the set of all permutations. 


Theorem 5.8 The set Sy of all permutations of X is a group under composition of 
functions. 


Proof We check the group axioms for Sy. By Theorem 5.6, if f, g € Sy, then fog e€ 
Sx. The associativity axioms holds by Theorem 5.1. The identity element is idx. 
Finally, the definition of an inverse function shows that if f~! is the inverse of f, 
then f is the inverse of f~'. Consequently, f~' is a bijection. a 


Since the composition of functions is not commutative, it follows that Sy is not 
abelian, for |X| > 3. 

Let us make a small example to understand better the connection between the 
intuition and the formal definition. 
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Example 5.9 Let X = {O, Hf, A}. Then, the permutations that belong to Sx are: 


@o-0@ @®—-H 
idx : | es | fi: | so) 
A-A A-A 
O-8 @®-A 
fo: HOA fs: | es | 
A>®@ A-@® 
@-0 @-A 
fa: HSA fs: | 9) 
A-8 A—-H 


Exercises 


—_ 


N 


. Define f : N > Nby f() =2, f(2) =3, £3) =4, f(4 =7, f(7) = 1, and 


f(n) =n for any othern € N. Show that f o fo fo f o f =idy. Whatis f—! 
in this case? 


. Let f :Z— Z bea function. For each of the following cases, find a left and a 


right inverses if exist. 


x if x is even 

2x +1 if x is odd 
x/3 if x = 0(mod 3) 
x +1 otherwise. 


(a) f= | 
(b) fs) =| 


. Let f :Z— Z be defined by f(x) = ax + b, where aandb are integers. Find 


the necessary and sufficient conditions on a and b such that f o f = idz. 


. Let f : X — X bea function such that f(f(x)) = x, for all x € X. Prove that 


f is asymmetric relation on X. 


. Afunction f : X — Y is said to be left cancellable if for any set Z and for any 


mappings g and h from Z to X such that f o g = f oh, then g = A. Prove that 
afunction f : X — Y is left cancellable if and only if f is one to one. 


. A function f : X — Y is said to be right cancellable if for any set Z and for 


any mappings g and fh from Y to Z such that go f =ho f, then g = h. Prove 
that a function f : X — Y is right cancellable if and only if f is onto. 


. Given two sets X and Y we declare X < Y (X is smaller than Y) if there is a 


mapping of Y onto X but no mapping of X onto Y. Prove that if X =< Y and 
Y x Z, then X < Z. 


. If X isa finite set and f is a one to one function of X, show that for some positive 


integer n, 
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fofo...of =idx. 
a 


n times 


9. If X has m elements in Exercise 8, find a positive integer n (in terms of m) that 
works simultaneously for all one to one mappings of X into itself. 
10. Ifa e X and H ={f € Sx | f(a) = a}, show that H is a subgroup of Sy. 
11. Let X be an infinite set and let H be the set of all permutations f € Sy such that 
f(a) # a for at most a finite number of a € X. 


(a) Prove that H is a subgroup of Sx; 
(b) Show that if f € Sy, then f-'Hf = H. 


12. If X has three or more elements, show that we can find f, g € Sx such that 
fosF#gof. 

13. Observe that for any positive integer x, we have x = 2” (2n + 1), for some non- 
negative integers m and n. This means that we can define f : N > (NU {0}) x 
(N U {0}) such that f(x) = (m,n), as indicated above. Prove that f is one to 
one and onto. 

14. (Schréder—Bernstein Theorem). Let X and Y be two sets such that 


(a) For a subset A of X, there is a one to one correspondence between A and Y; 
(b) For a subset B of Y, there is a one to one correspondence between B and X. 


Prove that there exists a one to one correspondence between A and Y. 
15. Let G be a group and let a be a fixed element of G. Show that the map f, : G > 
G, given by f,(x) = ax, for x € G, is a permutation of the set G. 


5.2 Symmetric Groups 


In this section we briefly introduce some basic concepts and constructions that we 
will need later. Permutations are usually studied as combinatorial objects, we will 
observe that they have a natural group structure. 


Definition 5.10 The group Sy is called the symmetric group or permutation group 
on the set X. 


The group of permutations of the set X = {1,...,n} is denoted by Sy). 


Theorem 5.11 The order of S, is equal to n\. 


Proof We count how many permutations of {1,2,...,} exist. We have to fill the 
boxes 

12 3 n 
with numbers 1, 2, ..., with no repetitions. For box 1, we have 1 possible choices. 


When one number has been chosen, for box 2, we have n — | choices, and so on. 
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Consequently, we have 
n(n — 1)(n—2)...2-l=n! 


permutations and so the order of S,, is |S,| = n!. a 


We can describe a permutation o € S,, in several ways. A convenient notation for 
specifying a given permutation o € S,, is 


LZ Bg oh 

a, a2 a3... An)’ 
where a, is the image of k under o, for each 0 < k <n. In this case, we write 
ko = a,x. Accordingly, regarding this notation one must be absolutely sure as to 


what convention is being followed in writing the product of two permutations. If 
t,o € S,, then we reiterate that ot will always mean: first apply o and then Tt. 


Example 5.12 Let 


There is another notation commonly used to specify permutations. It is called 
cycle notation. 


Definition 5.13 Let 1 < k <n and let a), a, ..., a, be k disjoint integers between 


1 andn. The cycle (a; az ... ax) denotes the permutation of S,, that sends 


a7 dQ, 
an > 4, 


ag > ak, 
a> a, 


and leaves the remaining n — k numbers fixed. We say that the length of the cycle 
(a, ad... ag) is k. 


It is clear that our choice of starting point for the cycle is not important. Thus, 
(a, az ... Ag) = (& ... ag a). The inverse of a cycle is a cycle. More precisely, 
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Fig. 5.1 An illustration of © G) 
cycle notation 


(ay ay ... ag)! = (ay ag_1 ... ay). 


Example 5.14 As an illustration of cycle notation, let us consider the permutation 
1234567 
5314762)° 

This assignment of values could be presented schematically as in Fig.5.1. 


Example 5.15 The cycle (3 4 1 6) means the permutation where 3 > 4, 4 —> 1, 
1 —> 6, 6 — 3, and all the other elements are fixed. So, (3 4 1 6) € S7 corresponds 


to 
34567 
e419=( 63 24153 ae 
Example 5.16 Suppose that (1 3 4 2) and (2 5 3) are two cycles in S;. Then 


[O34 
G34n@53)={ 55 4 
{23 
214 
=(12)34 


)(3 4 5). 


Y 


Example 5.17 We may write $3, in Example 5.9, as 
S3 = {id, (12), (13), (23), 123), 13 2)}. 


The following is Cayley table for $3. 
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; id doy. 13). 23) 23). (132) 
id id a2). (3). 23) 123) 3:2) 
(12) | (2) id (4:23) 432). 3) -@3) 
(13) | 403) G32) dd -023)° 03) ad 
G3) | 3)" “Aas ao); 7a (12) (13) 
(123)| 123) (23) (12) (13) (132) id 

(1) 3.2)-| (13 By. 3) "By. 29 id. $23) 


Definition 5.18 Two cycles (a; a2 ... ag) and (bj by ... b;) are distinct if 
{a}, do,..., ar} N{by, bo, ..., bj} = GB. 


Lemma 5.19 Ifo = (a, a2 ... ax) andt = (bj by ... by) are distinct, thenot = 
TO. 


Proof Let 1 <i <k—1. Since a; ¢ {b,,..., bj}, it follows that a;t = a;. Hence, 
we get 

aj(TO) = ajo = aj41. 
Also, since aj; ¢ {b,,..., bj}, it follows that 


aj(OT) = Giyi1T = Git. 


Similar arguments for each 1 < j </ show that 


bj(to) => bj410 = Dj41 = bjt — bj(oT) 


and 


a(ot) =ayt =a, =qQyo =a,;(TOo), 
bi(ot) = bt = bi = bho = by (to). 


Finally, if 7 € {a,,...,q,b,..., by}, then 


jo) = (Jno = jo = j = jt = (jo)t = jor). 


Therefore, for each j € {1,...,m} we have j(ot) = j(to). This yields that ot = 
TO. a 


Leto € S,. For each x, y € {1,2,...,}, we define the relation 


x=, y & x =xo* for some integer k. 


Lemma 5.20 The relation =, is an equivalence relation. 


Proof Indeed, we have 


(1) x =, x since x = xo°. 
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(2) Ifx = y, then y = xo*. Hence, x = yo-* a implies that y =, x. 
(3) Ifx =, yand y =, z, then y = xo/ andz = yo* for some integers j, k. Hence, 
we obtain 
z= yok =xolok =xolt, 


This implies that x =, z. 
a 


This equivalence relation induces a decomposition of {1, 2, ..., m} into disjoint 
subsets, namely the equivalence classes. Suppose that m, is the smallest positive 
integer such that xo”* = x. Then, the equivalence class of x under o consists of the 
numbers x, x0, xo07,...,xo0™7!, 


Theorem 5.21 Every permutation in S, can be written as a cycle or as a product of 
disjoint cycles. Up to reordering the factors, this is unique. 


Proof Leto be any permutation in S,,. Then, its cycles are of the form (x xo xo? ... 


xo™—!), Since the cycles of o are disjoint, it follows that the image of a € 
{1,2,...,m} under o is the same as the image of a under the product, 6, of all 
distinct cycles of o. Consequently, o and 6 have the same effect on each element of 
{1,2,...,n}. Therefore, o = 6. In this way, by Lemma 5.19, we observe that every 
permutation can be uniquely expressed as a product of disjoint cycles. | 


This factorization is called the cycle decomposition of o. The cycle structure of 
o is the number of cycles of each length in the cycle decomposition of o. For each 
k =1,...,n assume that m,; denote the number of cycles of length k. Then, we say 
that o has cycle structure 


my, m2 My 
As notation for cycle type, we abbreviate this to 1”', 27”, ..., 7". 


Example 5.22 The permutation 
o = (1 2)356)(48) 


in Sg has cycle structure consisting of one cycle of length 1, two cycles of length 2, 
and one cycle of length 3. 


Theorem 5.23 The number of permutations in S, of cycle structure of the form 
m2”, ...,n"" is equal to 


n! 


mi)...myl 122 2. nn” 
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Proof A permutation of the given cycle structure is produced by filling the integers 
1,2,...,n into the following boxes: 


——$ ———_—<———S 
my, m2 m3 


There exist n! ways of doing this. But some of these ways give the same permutation 
of S,,. We try to count them. 


(1) There exist m,! permutations of cycles of length 1, m2! permutations of cycles of 
length 2, m3! permutations of cycles of length 3, and so on. So, we must divide 
by m,!...my!. 

(2) Each cycle of length 2 can be written in two ways, i.e., (a b) = (ba). Similarly, 
each cycle of length 3 can be written in three ways, i.e., (a bc) = (bc a) = 
(c a b), and so on. So we must divide by 172"? ...n’”". 


This completes the proof. a 
Definition 5.24 A cycle of length 2 is called a transposition. 
Corollary 5.25 Any cycle in S, is a product of transpositions. 


Proof If (a, a2 ... ag) is an arbitrary cycle in S,, then 
(ay dz ... Ax) = (A) a2) (ay 3)... (G1 An), 


as desired. |_| 


Theorem 5.26 Every permutation in S, (n > 2)is either a transposition or a product 
of transpositions. In other words, Sy, is generated by transpositions. 


Proof First, note that the identity permutation can be expressed as (1 2)(1 2), so it 
is product of transpositions. By Theorem 5.21, we know that every permutation is 
a cycle or a product of cycles. By Corollary 5.25, since each cycle is a product of 
transpositions, it follows that S,, is generated by transpositions. a 


Theorem 5.27 The symmetric group S, is generated by n — | transpositions (1 2), 
(1 3), ..., dn). 


Proof By Theorem 5.26, we know that S, is generated by transpositions. Now, if 
(a b) be an arbitrary transposition, then 


(ab) = (la)(1 b)(1a). 


This yields the desired result. | 


Theorem 5.28 The symmetric group S,, is generated by n — | transpositions (1 2), 
(2 3),...(n—I1n). 
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Proof By Theorem 5.26, it suffices to show that each transposition (a b) in S, is 
a product of transpositions of the form (i i + 1), where i < n. Suppose that a < b. 
For the proof we use mathematical induction on b — a that (a b) is a product of 
transpositions (i i + 1). This is obvious when b — a = 1, because (a b) = (aa + 1) 
is one of the transpositions we want in the desired generating set. Now, suppose that 
b—a=k > (and the theorem is true for all transpositions moving a pair of integers 
whose difference is less than k. We have 


(ab) =(aa+1)(a+1b)aa+l). 


The transpositions (a a+ 1) and (a a+ 1) lie in our desired generating set. For 
the transposition (a + 1 b) we have b— (a+ 1) =k—1 <k. So, by assumption, 
(a + | b) is a product of transpositions of the form (i i + 1), so (a b) is as well. Mf 


Lemma 5.29 A cycle of length m has order m. 
Proof It is straightforward. | 


Theorem 5.30 The order of a permutation written in disjoint cycle form is the least 
common multiple of the lengths of the cycles. 


Proof Suppose that o € S, and o = 0102...0,%, where the 0; (( = 1,...,k) are 
disjoint cycles of length m;. Let m be the least common multiple of m1, m2, ..., mx. 
Since m;|m for each 1 <i < k, it follows that 


ao” = (0102...0%)" =o;"05"...0;" = id, 


where id is the identity permutation in S,. Consequently, the order of o is at most 
m, 

Now, suppose that o” = id. This implies that ofo3 ...0; = id. Since o; (i = 
1,..., k) are disjoint, it follows that o/ = id. Since o; is of order m,, it follows that 
m;|r. This yields that m|r. Therefore, we conclude that o is of order m. a 


Example 5.31 We want to determine the number of permutations in S7 of order 3. 
By Theorem 5.30, it is enough to count the number of permutations of the form 


(1) (abc), 
(2) (abc)(x yz). 


For the first case, there exist 7 - 6 - 5 such triples. But this product counts the permu- 
tation (a b c) three times. Thus, the number of permutations of the form (1) is equal 
to 70. 

For the second case, there exist 70 ways to create the first cycle and 432 = 8 to 
create the second cycle. So, we have 70 x 8 = 560 ways. But this product counts 
(a b c)(x y z) and (x y z)(a bc) as distinct while they are equal permutations. 
Consequently, the number of permutations in S7 of the form (2) is 280. 

Therefore, we have 70 + 280 = 350 permutations of order 3 in $7. 
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Lemma 5.32 Let o be any permutation in S, and let 
o = (a, ... aj)(b) ... bj)... (e1 --- Cx) 


be the cycle decomposition of o. Then, for each t € Sn, we have 


clot = (ait ... at)(Qit ... bjt)... (cit ... cet) (5.1) 


which is a product of disjoint cycles. 


Proof We have 
(a\t)t~'ot =ayot = ant, 


(a;_{t)t~!ot = aj_jot =aQ;T, 
(ajt)t~'ot =ajot =ayt, 
(bit)t~'ot = bot = bot, 


(bj-1t)t ot — bj-10T — bjt, 
(bt)t— at = bjot = byt, 


(cjt)t !ot =clot = cot, 


(Ch_-1T)t LOT = Ce_1OT = CT, 
cyt) tot = cpot = CT. 
k k 1 


Moreover, if o do not move integer d, then tot do not move dT. So, we see that 

the right side of (5.1) acts on every integer of {1,..., m} in the same way as t lor. 

This completes the proof. | 
Now we are ready to cut down the size of a generating set for S, to two. 


Theorem 5.33 The symmetric group S, is generated by the transposition (1 2) and 
the cycle (12... n). 


Proof By Theorem 5.28, itis enough to show that the products of (1 2) and (12 ... 1) 
give all transpositions of the form (i i + 1). We may take n > 3. Suppose that tT = 
(12 ... n). Then, by Lemma 5.32, we have 

¢ (U 2r=a 727) = C3), 
and more generally fori = 2,..., — 1, we obtain 


r~"-D qc! = (Ie?! Ar!) = G+ 1). 


This completes the proof. a 
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Theorem 5.34 Two permutations in S, are conjugate if and only if they have the 
same cycle structure up to ordering. 


Proof Suppose that o and 6 are conjugate. Then, there exists t € S, such that 
t lot = 6. Now, by Lemma 5.32, we conclude that o and 6 have the same cycle 
structure. 


Conversely, suppose that 


o = (aq edie a;)(b, i bj)... (e1 ae Ck), 
6 = (ay... a)(Di ... bi)... (ec ian CL) 


be two permutations of S,, with the same cycle structure. Now, we define Tt to be the 
permutation of S, which sends 


a, > aj, ... aj > Gj, 
A i 
bi > bj, ... bj > 5%, 


Cl > Ch, .. Ck > Ch. 


By Lemma 5.32, t~!ot and 6 are the same permutation. a 


Definition 5.35 Two permutations o and t in S, are said to be similar if there exists a 
one to one correspondence between the cycles of o and t such that the corresponding 
cycles have same length. 


Corollary 5.36 Two permutations in S, are similar if and only if they are conjugate. 
Definition 5.37 A group G is centerless if Z(G) = {e}. 


Theorem 5.38 S,, is centerless ifn > 3. 
Z(Sn) = {id}. 


This means that the center of symmetric group is the subgroup comprising only the 
identity permutation. 


Proof Suppose that o is anon-identity permutation in Z(S,,) and leto = 0102... 0%, 
where 0; are distinct cycles with lengths /;s such that], <--- <I, < 1,. Weconsider 
the following two cases: 
Case 1: Let o, = (aj do ... Gm) with m > 3. Since o € Z(S,), it follows that 
oO (a, do) = (a, ay)o. Since o;s are distinct, it follows that oj (a, a2) = (a, az)o}, or 
equivalently 
(ay az... Am) (G1 a2) = (Gy a2 «.. An) (Gi 2). 


This is a contradiction, because in the left side of the equality we have a, > a, 
while in the right side we have a, > ao. 
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Case 2: Let o,; = (a, a2). Since n > 3, it follows that there exists a3 such that 
a3 # a, and a3 A a. Since o € Sy, it follows that o(a; az a3) = (a) ay a3)o, or 
equivalently 


(ay 42)02... OK (Gy Az a3) = (Ay Az 43)(A) A2)02... OK. 


Since a, and a2 do not appear in cycles 02, ... , ox, it follows that in the left side of the 
last equality a; — a3 while in the right side a; — a,. This is again a contradiction. 
Therefore, we conclude that Z(S,,) = {id}. a 


Exercises 
1. Let 
{12345678 wid Pe 12345678 
NB GA 57D. 3 =O G58. 6 SF 
Compute each of the following: 


(a) a7; 
(b) aBae!; 
(c) a3; 
(d) aB~?. 


2. Write each of the following permutations as a product of distinct cycles: 


(a) (345 6)(4 3)C1 2 3); 
(b) (1 2)2 3)(24)(1 35). 


3. Give the Cayley table for the cyclic subgroup of S; generated by 
(12345 
aa é 451 
. Determine eight elements in S¢ that commute with (1 2)(5 6)(3 4). Do they form 
a subgroup of S¢? 
. Find five subgroups of Ss of order 24. 
. Find the number of permutations in the set {o € Ss | 20 = 5}. 
. How many elements of order 6 are there in the symmetric group S11? 
. Find all powers of the cycle 0 = (x; x2 ... Xn). 
. Find all permutations in the symmetric group S,, which commute with the cycle 


(x1 X2 ... Xn), where x; x2 ... X, 1S a permutation of the numbers 1, 2, ..., 7. 
10. Count the number of elements of S, having at least one fixed point. 


aN 


OmMDAINMN 


24. 


25. 


26. 
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. Given the permutations aw = (1 2)(3 4) and 6 = (1 3)(5 6). Find a permutation 


1 


y such that y ‘ay = B. Is y unique? 


. Prove that the permutations 


are conjugate in the symmetric group S¢, and find the number y € S¢ such that 
-1 
yay =p. 


. Show that the permutations in Sy which send the numbers 2, 5, 7 among them- 


selves form a subgroup of So. What is the order of this subgroup? 


. If is at least 3, show that for some f € S,, f cannot be expressed in the form 


f = 8°, for any g € Sp. 


. What is the smallest positive integer n such that S, has an element of order 


greater than 2n? 


. Show that in 57, the equation x* = (1 2 3 4) has no solutions but the equation 


x? = (1234) has at least two. 


. Let X be the set Z3;, andlet f : X — X be the permutation f(x) = 2x. Decom- 


pose this permutation into disjoint cycles. 


. Let X be the set Zy9, and let f : X —> X be the permutation f(x) = x*. Decom- 


pose this permutation into disjoint cycles. 


. Find the cycle decomposition of the permutation induced by the action of com- 


plex conjugation on the set of roots of x> — x + 1. 


. In S4, find the subgroup generated by (1 2 3) and (1 2). Also, for this subgroup, 


find the corresponding subgroup 0! Ho, for o = (1 4). 


. Find necessary and sufficient conditions on the pair i and j in order that 


(12... n), @ f)) = Sh. 


. Show that for all 1 <i <n, we have (23 ... n), (i)) = Sp. 
. Determine a permutation o € S, such that for every 1 <i, j <n, 


i<j >io< jo. 


Find the maximum possible order for a permutation in S, for n = 5, n = 6, 
n=7,n = 10, andn = 15. 

A permutation is called regular if it can be decomposed into disjoint cycles 
of the same length. Prove that every power of a cycle of length n in S,, is a 
regular permutation. Prove that the length of each of the disjoint cycles in this 
decomposition divides n. 

Prove that every regular permutation is a power of some cycle. 
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5.3 Alternating Groups 


The alternating groups are among the most important examples of groups. We study 
some of their properties in this section. 


Theorem 5.39 [fa permutation o can be expressed as a product of even number of 
transpositions, then every decomposition of o into a product of transpositions must 
have an even number of transpositions. In symbols, if 


O=TT%M...T% and o = 6;62...5m 


where the t’s and the 5’s are transpositions, then k and m are both even or both odd. 


Proof We consider a polynomial p of n variable 


P(a,..-,4n) = (a) — a2)(a, — a3)... (A) — Gn) 
(a2 =. a3) (a2 a a4) tee (a2 a an) tee (Gn—1 a4 an) 


= | [@ = aj). 


i<j 
Ifo € S,, we define 


o*(p(a, ies , dy)) = | [@e. _ Aja): 


i<j 
Suppose that t = (7's) is a transposition with r < s. Then, we have 


t*(p(a, ies 0%) = [[@. _ jr). 


i<j 


Note that a,; — ds; = ds — a, = —(a; — as), and if a, and a, do not exist in a factor, 
then this factor is fixed under t. The other factors can be expressed in one of the 
following forms: 


(1) (as — ag) (a; — ag), ifs < k, 


(2) (a, — as)(a, — ay), if r<k <s, 
(3) (a — as) (ax — ay), 1£K <r. 


Therefore, we conclude that t* (pai, sey An)) = —f(a,...,Qn)- 
Ifo = TM... T,%, where 7), ™,..., T; are transpositions, then 
o*(p(a, silly dn) = (tit a t%) (pla, Lats An)) (5.2) 


= (—1)* p(a, sacecy ns 


Similarly, if o = 6,62 ...6,, where 61, 52,...,6,, are transpositions, then 


120 5 Permutation Groups 


o*(p(ai, dea Gn)) a (5152 aod 5m) (p(ai, oer An)) 
= (-1)" pla, .-- ay). me 


Comparing (5.2) and (5.3), we conclude that 
(-Df = (-1)”. 


This implies that these two decompositions of o as the product of transpositions are 
of the same parity. | 


Therefore, any permutation is either the product of an odd number of transpositions 
or the product of an even number of transpositions, and no product of an even number 
of transpositions can be equal to a product of an odd number of transpositions. 


Definition 5.40 A permutation which can be expressed as a product of an even 
number of transpositions is called an even permutation. A permutation which can be 
expressed as a product of an odd number of transpositions is called an odd transpo- 
sition. 


If we define the sign of a permutation o as 


P(a1c; sey anc) 


sgn(o) = 
P(a, eeu ¢ An) 


then 
1 if o iseven 


sen(o) = S if o is odd. 


Ifo, 6 € S,, then 


Qiigdys005 4 
sgn(o) = PC lod nos) 
(a1, ..-,4n) 
= P\Q1o5; + +5 Anos) . P(aic, 22 +3 no) 


DP(ic,---;4no) P(a1,---, Qn) 
= sgn(o) - sgn(6). 


To summarize in words, the sign of the product is the product of sign, 


even x even = odd x odd = even, 
even x odd = odd x even = odd. 


Theorem 5.41 Jf A,, is the set of all even permutations, then A, is a subgroup of Sy. 


Proof If o,6 € Ay, then we have 04 € A,. Since A, is a finite closed subset of the 
finite group S,, it follows that A, is a subgroup of S,. | 


A, 1s called the alternating group of degree n. 


Theorem 5.42 [fn > 1, the order of A, is equal to n!/2. 
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Proof For each even permutation o, the permutation (1 2)o is odd, and if o #4 6, 
then (1 2)o 4 (1 2)6. Hence, there are at least as many odd permutations as there are 
even ones. On the other hand, for each odd permutation o, the permutation (1 2)o 
is even, and if o #4, then (1 2)o ¥ (1 2)5. Hence, there are at least as many even 
permutations as there are odd ones. This yields that there exist equal numbers of 
even and odd permutations. Since |S,| = n!, it follows that |A,| = n!/2. a 


Theorem 5.43 For each n > 3, A, is generated by cycles of length 3. 


Proof Suppose that o € A,. Then, o is a product of even number of transpositions. 
Let a, b, c, and d are four different numbers between 1 and n. Then, we have 


(a b)(ac) = (abc), 
(a b)(cd) =(acb)(cba). 


This completes the proof. | 


Theorem 5.44 For eachn > 3, A, is generated by cycles of the form (1 a b), where 
2<a,b<n,anda#b. 


Proof If o € A,, then o is a product of transpositions. Since (a b) = (1 a)(1 D) 
(1 a) foreach2 < a,b <n,anda & b, it follows that o is a product of transpositions 
of the form (1 a). Since o is even, the number of transpositions of the form (1 a) 
in o is even. But for each a # b, we have (1 a)(1 b) = (1a b). This completes the 
proof. | 


Theorem 5.45 For eachn > 3, A, is generated by cycles of the form (1 2 a), where 
2<a<n. 


Proof If n = 3, then A3 = {id, (1 2 3), (1 3 2)} is generated by (1 2 3). So, we 
assume that n > 4. 

Each cycle of length 3 in A,, containing | and 2 is generated by the cycle of the 
form (1 2 a), because (1 a 2) = (12a)7!. 

For each cycle of length 3 in A, containing | but not 2, we have 


dab)=(12b)12a)12b)1 25D). 
Now, by Theorem 5.44, the proof completes. | 


Theorem 5.46 For each n > 3, A, is generated by consecutive cycles of the form 
(aa+1la+2), wherel1<a<n-—2. 


Proof If n =3, then A3 = ((1 2 3)). If n=4, then by Theorem 5.45, Ay = 
(U1 2 3), (1 2 4)). Since 


(24) =(123)234)0 2 3)d 23), 
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it follows that Ay = ((1 2 3), (23 4)). Now, assume that n > 5. By Theorem 5.45, it 
suffices to show that (1 2 a) can be obtained from a product of consecutive cycles of 
length 3. We apply mathematical induction on a. Let a > 5 and (1 2 b) be a product 
of consecutive cycles of length 3, for 3 < b < a. We have 


(2a)=(2a—-1)02a—-2)(a—2a—-laji2a—1)i2a—2). 


Now, the inductive assumption show that (1 2 a) is a product of consecutive cycles 
of length 3. a 


Theorem 5.47 For eachn > 3, A, is generated by 


(1) (123) and (12 ... n) ifn is odd; 
(2) (123) and (23 ... n) ifn is even. 


Proof Note that if n = 3, then we are done. So, we suppose that n > 4. 
(1) Let n be odd and t = (12 ... nm). Then, we conclude that t € A,. Moreover, 
for each 1 < a < n — 3, by Lemma 5.32, we get 


tT 91 23)t% = (104 20% 3%) = (a+ 1la+2a+3) € Ay. 


Now, by Theorem 5.46, we are done. 
(2) Let n is even and t = (23 ... n). Then, we have t € A,. Also, for each 
1 <a <n -—3, by Lemma 5.32, we obtain 


tT “(1 23)t% = (10% 20% 3t%) = (La+2a+3) € Ay. 
Finally, since (1 a+ 1a+ 2) and (1 aa-+ 1) arein A,, we can write 
(la+la+2)laa+1)=(aa+1a4+2) € Ay. 
Now, by Theorem 5.46, the proof completes. | 


Theorem 5.48 [fn > 5, then all cycles of length 3 are conjugate in Ay. 


Proof Suppose that o and 6 are two cycles of length 3 in A,,. By Theorem 5.34, there 
exists a permutation t € S, such that t-'ot =6. If t € Aj, then we are done. So, 
suppose that t ¢ A,. Leto = (abc). Sincen > 5, it follows that there exist x and y 
notin {a, b, c}. Weset@ = (x y). Since@~'o@ =o, it follows that (@t)~'!o(6t) = 6, 
where Ot € Ay. a 


Theorem 5.49 For each n > 4, the center of Ay is 
Z(An) = {id}. 


This means that A, is centerless, for n > 4. 
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Proof We show that, for every non-identity permutation o, there is a permutation in 
A, that does not commute with o. 

Since o is not the identity, it follows that o maps an element a into b witha # b. 
Since n > 4, we can choose distinct c and d not equal to a and b. Now, we claim 
that the cycle (b c d) does not commute with o. Indeed, o (b c d) maps a into c, but 
(bc d)o maps a into b. Therefore, no o other than the identity commutes with every 
element of A,. In other words, no o other than the identity is in the center of Ay. 
Thus, the only element of the center of A, is the identity. a 


Exercises 


_ 


. Let o and t belong to S,. Prove that o~'t~!or is an even permutation. 
2. Prove that there is no permutation o such that o~'(1 3 4)o = (1 2)(467). 
3. Compute the order of each member of Ay. What arithmetic relationship do these 
orders have with the order of A4? 
4. Prove that As has a subgroup of order 12. 
5. Show that A, has no subgroup of order 6. 
6. Show that the group As contains no elements of order 4, and precisely 15 elements 
of order 2. How many elements of are there of orders 3, 6, respectively? 
7. Show that Ag contains an element of order 15. 
8. Find a cyclic subgroup of Ag that has order 4. 
9. Find a non-cyclic subgroup of Ag that has order 4. 
0. Suppose that H is a subgroup of S,, of odd order. Prove that H is a subgroup of 
An. 
11. Let 7 be an even positive integer. Prove that A, has an element of order greater 
than n if and only ifn > 8. 
12. Let n be an odd positive integer. Prove that A, has an element of order greater 
than 2n if and only ifn > 13. 
13. Let n be an even positive integer. Prove that A, has an element of order greater 
than 2n if and only ifn > 14. 
14. Let H = {o? |o € Sy}and K = {o? |o € Ss}. Prove that H = A,and K = As. 
15. Let H = {o7 | o € So}. Prove that H £ Ao. 
16. Why does the fact that the orders of the elements of A, are 1, 2, and 3 imply that 
|Z(A4)| = 1? 
17. For n > 1, let H be the set of all permutations in S, that can be expressed as a 
product of a multiple of four transpositions. Show that H = As. 
18. Consider S, for a fixed n > 2 and let o be a fixed odd permutation. Show that 
every odd permutation in S, is a product of o and some permutation in A,,. 
19. Show that if o is a cycle of odd length, then o? is a cycle. 
20. Show that every permutation in A, is a product of cycles of length n. 
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5.4 Worked-Out Problems 


Problem 5.50 Show that if n > m, then the number of cycles of length m in S,, is 
given by 
n(n — 1)(n—2)...(n—m+1) 


m 


(5.4) 


Solution We count how many cycles of length m in S, exist. We have to fill the 
boxes: 


1 2 3 m 


with the numbers 1, 2,...,7 with no repetitions. We have n choice for the first 
box. Then, n — 1 choice for the second box, n —2 choice for the third box, 
and so on. Finally, we have n —m-+1 choice for the last box. So, there are 
n(n — 1)(n —2)...(2 —m-+1) choices for a cycle of length m, but we empha- 
sis that some them are the same. For example, the following cycles are the same: 


e Ifm = 2, then (a b) = (b a) (2 equivalent notations); 

e Ifm = 3, then (a bc) = (bc a) = (ca b) (3 equivalent notations); 

e If m=4, then (abc d)=(bcdaj)=(cdab)=(dabc) (& equivalent 
notations). 


In general, by induction we deduce that for cycles of length m, there are m equiv- 
alent notations. Since we have n(n — 1)(n — 2)...(” —m +1) choices to form a 
cycle of length m in which there are m equivalent notations, it follows that the number 
of cycles of length m in S, is equal to (5.4). a 


Problem 5.51 If 7 is at least 4, show that every element of S, can be written as 
a product of two permutations, each of which has order 2. (Experiment first with 
cycles.) 


Solution First, we begin with an example. Let (a; az ... a7) be acycle. We consider 


a = (aj a7) (a2 dg) (a3 a5) 
B = (a2 A7)(43 6) (a4 as). 


Since a and 6 are products of disjoint transpositions, it follows that o(@) = o(B) = 2. 
Moreover, it is easy to see that a6 = (a; dz ... a7). Next, we generalize the above 
example to an arbitrary cycle 0 = (a, ad ... d,). We take 


a= (a, An) (az An—1) tee (a; An—i+1) see (Gin An—m+1), 
B — (az An) (a3 an 1) tee (aj +1 An—i+4 D) tee (Qin-+1 An—m+1)5 


where m = [n/2]. Again, since a and 6 are products of disjoint transpositions, it fol- 
lows that o(a@) = o(6) = 2. Now, we claim that o = af. Since a and £ are products 
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Table 5.1 A short table of values p(n) 
n 1 2 3 4 5 6 7 8 9 10 
p(n) 1 2 3 5 7 11 15 22 30 42 


of disjoint transpositions, what they do to any one a; is determined just by the trans- 
position containing that a;. Hence, fori < m, the transposition (q; a,_;+)) in@ sends 
a; tO d,_;+, and then the transposition (a;41 d,)—j+1) in B sends a,_j;+ to aj41. In view 
of this, ifi < m, then wf sends a; to aj41. Now, ifi > m, then wetake j =n —i+ 1. 
We have j < m andi =n — j +1. So, the transposition (aj d,—j+1) 1s in a and it 
sends a; = d,—j+1 to a; and the transposition (@j—141 @n—(j—1)41) = (@j Gn—j42) in 
B sends a; to a,_j+2. But since j =n —i + 1, it follows thatn —j+2=i+1. 
Therefore, we observe that wf sends a; to aj4; when i > m. Consequently, o = af. 

Finally, suppose that o is an arbitrary permutation. We can write 0 = 0102... 0%, 
where o;s are disjoint cycles. According to the above argument, each of o; can be 
written as the product of two permutations a; and 6;, where o(@;) = o(8;) = 2, and 
a; and B; only permute the numbers appear in o;. Since o;s are disjoint, if 7 A i, 
then a; and £; are disjoint from aw; and 6;. Consequently, «; commutes with a; and 
B;, for alli ~ j. Therefore, we conclude that 


O =0102...0, = AB A2Bo... a, Bx 
= 102...A 6) Bo... Be. 


Since a product of disjoint transpositions has order 2, it follows that 


O(01 02...) = 0(B1B2... Bx). 


This completes the proof. 
a 


Problem 5.52 Let bea positive integer. A sequence of positive integers 11, 72,..., 
ng such thatny > n2 >--- > np andn =n; +n. +---+ nx, 18 called a partition of 
n. Let p(n) denote the number of partitions of n. Table 5.1 is a short table of values 
p(n): Show that the number of conjugate classes in the symmetric group S, is p(n). 


Solution Let o be a permutation in S,,. We can write o as a product of distinct cycles 
as follows: 


(a, a2... ag, (by bo ae by)... (C1 C2! ee Ck;) 


such that kj > ky >--- > kj; and kj + ky +---+k; =n. This is a unique expres- 
sion, and so for each permutation we obtain a unique partition. By Corollary 5.36, 
two permutations are conjugate if and only if they are similar; in other words, they 
give rise to same partition. Hence, corresponding to a conjugate class we get a unique 
partition of n. 
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Conversely, letn, > nz >--- >n, andn =n, +n2+---+7n, bea partition of 
n. Then, there is a permutation t which has a cycle decomposition of the type 


(x1 X2 -.- Xn, )OV1 V2 ++. Vag). +- (Z1 22 +++ 2n,)- 


Each 6 € S, similar to t is conjugate to t, and every permutation in S, conjugate to t 
is similar to t. In this way, for each permutation we can associate a unique conjugate 
class, namely, conjugate class of T. 
Therefore, there exists a one to one correspondence between conjugate classes in 
S, and partitions of n. Consequently, the number of conjugate classes in S, is equal 
to p(n). 
a 


Problem 5.53 Let s(n, k) denote the number of permutations in S, which have 
exactly k cycles (including cycles of length 1). Show that 


s(n, 1) =(n—1)! 
and for k > 2 
s(n,k) =s(n—1,k —1)+(n—1)s(n—1,k). 


Also, prove that 


Yo sn, k)x* =x™ sa x(x4+-1)...~+n—D). 
k=1 


The s(n, k) are known as Stirling numbers of the first kind. The expression x) is 
known as the nth upper factorial. 


Solution Itis easy to see that s(n, 1) = (n — 1)!. In general, we sort the permutations 
in S, with exactly k cycles into two parts, depending on whether the permutation 
contains the cycle (n) of length 1. There exist s(n — 1,k — 1) permutations con- 
taining the cycle (7). The other permutations are formed by inserting n after any of 
the n — 1 elements in the s(n — 1, k) permutations of n — 1 elements into k cycles. 
Consequently, fork > 2, we obtain s(n,k) = s(n—1,k —1)+(— 1)s(n—1,k). 
Now, we can verify the formula for s(n, k) by induction. It is easy to see that the for- 
mula holds for n = 1. Suppose the formula is true for s(m, k), where m < n. Then, 
we have 
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n 


ye s(n, k)x* 


k=1 
= (n — 1)!x +Jo sn Ske 4a 1) Sosa —1,k)x* 
k=2 k=2 
n—-l 
=(n—-1)ix +x Yo s(n —1,k)x* 
k=1 
+(n — 1) (Sos =x =G5 2) 
k=1 
= (n— Ile +x") + (n= 1)(x°") — a — 2)1x) 
= (x +n — 1)x%-) 
= x, 
as desired. |_| 


5.5 Supplementary Exercises 


1. Let G be a group of order 2m, let g € G have order 2, and let A, : G > G be 
defined by g(x) = gx. Show that A, is a product of m disjoint transpositions. 
2. Show that the symmetry group of a rectangle which is not a square has order 4. 
By labeling the vertices, 1, 2, 3, 4 represents the symmetry group as a group of 
permutations of the set {1, 2, 3, 4}. 
. Prove that Sy is abelian if and only if |X| < 2. 
4. Let H be a subgroup of S,,. Show that either H is a subset of A, or exactly half 
of the elements of H are even permutation. 
5. List the elements of the following subgroup in Sq: 


1S) 


(1 4)(2 3), 1 2)G 4)). 


6. Consider the permutation 


123456 7 8 9 10 11 12 13 14 15 16 es 
31694 10 12 13 5 11 15 16 14 2 7 a 
(a) Find its sign and its order. Compute the centralizer o and compute the number 
of elements in this centralizer; 
(b) Write down a! as a product of disjoint cycles. 


7. Let G be a non-abelian group of order 2p for some prime p 4 2. Prove that G 
contains exactly p — 1 elements of order p and it contains exactly p elements 
of order 2. 

8. If p is a prime number, show that in S|, there are (p — 1)! + 1 elements x satis- 
fying x? =e. 
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13. 
14. 


17. 
18. 
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. In the symmetric group Sy, find two elements that neither commute with each 


other and are not conjugate to one another. 


. Let o be the cycle (1 2 ... m). Show that o* is also a cycle of length m if and 


only if k is relatively prime to m. 


. Which permutation of the set X = {x1, x2, x3, x4, x5} leave the polynomial 


xX, + x2 — x3 — x4 invariant? Find a polynomial in these variables which is left 
invariant under all permutations in the group ((x; x2 x3 x4), (x2 x4)) but not by 
all of Sy. 


. Ifo € Az, prove that 


1 
Ca, (o) = Cs,(o) or |Ca,(o)| = 51Cs, (D1. 


Ifo = (12... m) € Sy, show that |Cs (o)| = (n — m)!m. 
Let a(n, m) denote the number of permutations o € S, such thato” = Id (with 
a(0O, m) = 1). Show that 


d 
a(n, m) an = exp ye — 


n! 
d\|m 


Me 


3 
ll 
o 


. Letn > 2 and let A be the set of all permutations in S,, of the form 


on= [|] Gk-a), 


1<i<k/2 


fork = 3, 4, ..., n+ 1. Show that A generates S, and that each o € S, can 
be written as a product of 2n — 3 or fewer elements from A. 


. Let p be a prime congruent to 1 (mod 4), and consider the set 


X = {(x, y,z) € NW | x? + 4yz = ph. 
Show that the function 
(x+2z,z,y—-x-—z) ifx<y-z 


(x,y,z ¥ Qy—x,y,x-yt+z) ify-z<x <2y 
(x —2y,x-—y+z,y) ifx > 2y 


is a permutation of order 2 on X with exactly one fixed point. Conclude that the 
permutation (x, y, z) > (x, z, y) must also have at least one fixed point, and so 
x? + 4y? = p for some positive integers x and y. 

Find the permutation representation of a cyclic group of order n. 

(Stirling Numbers of the Second Kind). In Problem 5.53, we have seen that the 
Stirling numbers s(n, k) of the first kind count the number of ways to partition a 
set of size n into k disjoint non-empty cycles. The Stirling numbers of the second 
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kind, denoted by S(n, k), count the number of ways to partition a set of size n 
into k non-empty disjoint subsets. It is clear that S(n, 1) = 1. 


(a) Find S(n, 2); 
(b) Forn > 0, show that S(n,k) = kS(n — 1,k) + S(n—1,k — 1); 
(c) Show that 


sy She, 


k=1 


19. In the symmetric group S,,, for each k = 3, ..., n, let 


[4] 
on =] [Gk-3), 
i=l 


for example 
o3 = (1 2), 
o4 = (1 3), 
o5 = (1 4)(2 3), 
06 = (15)(2 4), 
07 = (16)(25)G3 4), 
og = (17)(26)G5). 


Show that permutations 01, 02, ..., O», generate S,_1. 
20. An affine geometry comprises a set X whose elements are called points together 


with various subsets of X called Jines such that 


(a) Each pair of distinct points is contained in exactly one line; 

(b) Each pair of distinct lines has at most one point in common; 

(c) Given a line L and a point P not on it, there exists exactly one line L’ which 
contains P and has no point in common with L; 

(d) There are at least two lines. Figure 5.2 gives a pictorial representation of an 
affine geometry with 4 points and 6 lines. 


Fig. 5.2. Affine geometry D 
bd 
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A collineation of an affine geometry is a permutation of the points of X which 
maps lines to lines. Show that the set of all collineations form a group under 
composition. What is the order of the collineation group of the above 4 element 
affine geometry? 


Chapter 6 ®) 
Group of Arithmetical Functions sive 
(Optional) 


In this chapter, we show that the set of all arithmetical functions f with f(1) 40 
forms an abelian group with respect to a special operation. Then, we discuss an 
important subgroup of this group. 


6.1 Arithmetical Functions 


In this section we introduce several mathematical functions which play an important 
role in the study of divisibility properties of integers. We begin with the definition 
of arithmetical functions. 


Definition 6.1 A real or complex-valued function defined on the positive integers 
is called an arithmetical function. 


Example 6.2 Euler function y defined in Definition 4.6 is a arithmetical function. 


Theorem 6.3 (Gauss Theorem) [fn is a positive integer, then 


Y\e@) =n. 


d\n 
Proof Suppose that X = {1, 2, ...,} and let 
S(d) = {k| (k,n) =d, 1 <k <n}. 


Then, the sets S(d) are disjoint and whose union is equal to X. If f (d) is the number 
of integers in S(d), then 


/O=n (6.1) 
d\n 
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Table 6.1 A short table of values pu(1) 
n 1 2 3 4 5 6 7 8 9 10; 11} 12} 13 | 14} 15 


Moreover, we have 
(k,n) =d  (k/d,n/d) =1, 
O<k<n S0<k/d <n/d. 


So, if we take g = k/d, then there exists a one to one correspondence between the 
elements of S(d) and those integers g satisfying 0 < k <n and (q,n/d) = 1. The 
number of such g is equal to y(n/d). Hence, we have f(d) = y(n/d). Now, by 
(6.1), we conclude that 


d\n 


This completes the proof, because when d runs through all divisors of n so 
does n/d. | 


A positive integer n is square-free if p* { n, for every prime p. 
Definition 6.4 The Mobius function py is defined as follows: 
1 ifn=1 


p(n) = 4 0 if n is not square — free 
(—1)* if n is the product of k distinct primes. 


Table 6.1 is a short table of values pu(7): 
Theorem 6.5 [fn is a positive integer, then 
1 1 ifn=1 
dH) = = = lo ifn>1. 
d\n 
Proof Clearly, we have j(1) = 1. Letn > 1 andn = p}''p;”... p;*. We know that 


L(d) is non-zero if d = 1 or those divisors of n which are products of distinct primes. 
Consequently, we can write 


Yo H@ = HA) + wp) + +++ + WPe) + MP pr) +o + HP-1 Pwd) 
d\n 
tes + (Pi p2--- Pk) 


=e (‘) eps (5) Ci es4 (;) (-1)! 


= (+1)? =0. 


6.1 Arithmetical Functions 133 


This completes the proof. a 
The Mobius function is related to the Euler function by the following theorem. 


Theorem 6.6 [fn is a positive integer, then 


ein) = T] wd). 


d\n 


Proof We can write 


n 1 
m= lap): 


k=l 
Then, by Theorem 6.5, we obtain 


ein) =)) DE u@ = DID ue. (6.2) 


= =| din 
k=1 d|(n,k) k=1 ts 


For a fixed divisor d of n we must sum over all those | < k <n which are multiple 
of d. If k = qd, then 


l<k<ne Isqse. 
By using this fact and (6.2), we can write 
fe ae 
y(n) = B 2 He) = ye 2, l= dM 
This completes our proof. a 


Theorem 6.7 [fn is a positive integer, then 


where p is prime. 


Proof Ifn = 1, then there are no primes which divide 1. Hence, the product is empty, 
and so y(1) = 1. Suppose that n > 1 and let pj, po, ..., Pm be the distinct prime 
divisors of n. The product can be written as 
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DOD W 


1 1 1 (-1)” 
=1 Dae 2 oes > eee = 


Pi Pj Pk Pi P2--- Pm 

(6.3) 
On the last line of (6.3), in a term such as )° 1/p; pj; pe, it means that we consider 
all possible products of p; p; px of distinct prime factors of n taken three at a time. 
Moreover, each term on the last line of (6.3) is of the form +1/d, where d is a divisor 
of n which is either | or a product of distinct primes. The numerator +1 is exactly 
L(d). If d is divisible by the square of any prime p;, then j1(d) = 0. So, we observe 
that (6.3) is exactly the same as 


M(-2)-D 4. 


pin 


Therefore, we conclude that 


etn) = wae =n MO =nf] (1-2). 


d\n d\n p\n 
as desired. |_| 


Many properties of y can be easily obtained from Theorem 6.7. Some of these 
are listed in the next theorem. 


Theorem 6.8 Euler function has the following properties: 
(1) y(p®) = p® — p®"|, for each prime number p and a > 1; 
d 
(2) y(mn) = plmp(n)(—), where d = (m,n); 
p(d) 


(3) ylmn) = v(m)p(n), if (m,n) = 1; 

(4) m|n implies p(m)|y(n); 

(5) y(n) is even, for each n > 3. In addition, if n has k distinct odd prime factors, 
then 2* |p(n). 


wm 


Proof (1) It is enough if we consider n = p® in Theorem 6.7. 
(2) By Theorem 6.7, for each positive integer n, we can write 


p(n) 1 
o =TT (1-5): 


pin 
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where p is prime. Each prime divisor of mn is either a prime divisor of m or of n, 
and those primes which divide both m and n also divide (m, n). Therefore, we have 


(2) E(-$) mr 


y(mn) _ (: | _ p\m P p\n _ m n 


oe) Fe 


plom,n) - 


and consequently, we obtain (2). 

(3) It is a special case of (2). In fact, it is enough to consider d = | in (2). 

(4) Since m|n, it follows that there exists an integer 1 < c <n such thatn = cm. 
If c =n, then m = 1 and (4) is obviously satisfied. So, let c < n. By using (2), we 
have 


d 
shegnesomun =a 


: 6.4 
yd) yd) = 


where d = (m, c). Now, we use mathematical induction on n. If n = 1, then (4) is 
true obviously. Assume that (4) is true for each integer less than n. Since c < n and 
d|c, by induction hypothesis, it follows that y(d)|y(c). Therefore, we deduce that 
the right part of (6.4) is an integer multiple of y(m). This yields that y(m)|y(n). 
(5) If m = 2° and a > 2, then by part (1) we obtain y(n) = y(2°) = 2°-!. This 
means that y(7) is even. Now, we assume that n has at least one odd prime factor. 


Then, we have 
vm -aT1 (1) -a11(54 


pin P 


[[@-» 


ae yea a (p — 1). 
[|p nea 


pi\n pin 


Therefore, we conclude that 


y(n) = c(n) | [( - D. 


p\n 


where c() is an integer. Since the product multiplying c(m) is even, it follows 
that y(n) is even. In addition, each odd prime p gives a factor 2 to this product. 
Consequently, if n has k distinct odd prime factors, then 2“ |p(n). | 
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Exercises 
1. Show that ifn — 1 andn + 1 are both primes, with n > 4, then 
(n)<* 
n =. 
ead 
2. Find all n for which y(n) = O(mod 4). 


3. For each of the following statements either give a proof or exhibit a counter 
example. 


(a) If (m,n) = 1, then (y(m), y(n)) = 15 
(b) If is composite, then (n, y(n)) > 1; 
(c) If the same primes divide m and n, then ny(m) = mp(n). 
4. Prove that y(n) > n/6 for all n with at least 8 distinct prime factors. 
5. Prove that 
Y= wd) = p(n). 


d\n 


6.2 Dirichlet Product and Its Properties 


The Dirichlet product is a binary operation defined on arithmetical functions. It 
is commutative, associative, and distributive over addition and has other important 
number-theoretical properties. 


Definition 6.9 Let f and g be two arithmetical functions. We define the Dirichlet 
product of f and g by 
n 
soi = Fre) 
(f * g)(n) X f@e(z 
for all positive integer n. 


Theorem 6.10 Let AF be the set of arithmetical functions f such that f (1) 4 0. 
Then, (AF, *) forms an abelian group. 


Proof Since (f * g)(_) = f(1)g(), it follows that AF is closed under Dirichlet 
product. The commutative property is evident from noting that 


df@s (5) = Vo f@eo. 
d\n ab=n 


In order to prove the associative property, suppose that f, g, and h are any mathemat- 
ical functions. Let A = g * h and consider f « A = f * (g * h). Then, we obtain 
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(f * A)(n) = > f@AQ@) 


ad=n 


= V2 f@ VY sHho) 


ad=n bc=d 


= Yo f@gb)nco). 


abc=n 


Similarly, if we assume that B = f * g and consider B «hh = (f * g) * h, then we 
have 


(B*«h)(n) = Y~ B(d)h(a) 


ad=n 

= )) h(a)B(d) 
ad=n 

= Var@ D> f)g©) 
ad=n bc=d 

= DY f®sg orca). 
abc=n 


So, we conclude that f « (g *h) = (f * g) * h,i.e., Dirichlet product is associative. 


Now, let 
em =ltlaf) teat 
“| | | 0 otherwise 


It is clear that e € AF. Moreover, if f € AF, then we have 


From = rW@e(4)=Fs@ I< = fin), 


d\n d\n 


for each positive integer n. This means that e is the identity element of AF. Finally, 
we must show that for given f € AF, there exists f~! € AF such that f * f~! =e. 

Let f is given. We construct f~! inductively. First, we need (f * f~!)(1) = e(J), 
which occurs if and only if f(1) f~'(1) = 1. Since f(1) 4 0, it follows that f—!(1) 
is uniquely determined. Now, suppose thatn > 1 and f—! determined for eachk <n. 
Then, we have to solve the equation (f * f~!)(n) = e(n), or equivalently 


DFG) Fo =0. 


d\n 


This can be written as 


PDF YF (F) FW =0, 


din 
d<n 
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and so 


f= An ~fG) re. 


din 
d<n 


Since the values f—!(d) are known for all divisors d < n, it follows that there is a 
uniquely determined value to f~'(n). Consequently, we may uniquely determine an 
f—' for each f € AF. a 


Remark 6.11 If we consider the usual product of functions, ie., (f - g)(m) = 
f(n)g(n), then the identity element is J (m) = n, for all positive integer n, because 
I. f = f, for all function f. In the group AF, however, J is certainly not identity 
element. But it has the nice property of transferring each function f into its so-called 
sum-function S;. 


Definition 6.12 We define 
Syn) = >> fd) 
d\n 


to be the sum-function of f € AF. 


Note that S; € AF too. Moreover, it is easy to see that J * f = f* I = Sy, for 
all f € AF. 


Lemma 6.13 The Dirichlet inverse of I is the Mébius function 1. 


Proof According to Theorem 6.5, we have 


Yo ud) = 1). 


d\n 


In the notation of Dirichlet product this becomes ps * J = I * ps = e. Therefore, [ 
and js are inverses of each other. | 


Theorem 6.14 Each arithmetical function f can be expressed in terms of its sum- 


function Sf as 
fin) = Yds; (5). 


d\n 
Proof We have 
pow Sp =x (Le f= (wale f sex f= ff, 
and this completes the proof. a 


Theorem 6.15 (Mébius Inversion Theorem) For two arithmetic functions f and g 
we have the following equivalence: 
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fn) = Yad) & sim = f@u (5): 


d|n d\n 
Proof If f = g * I, then we have 
feu=(eeleu=gxUepl=grxe=se. 
On the other hand, if f * js = g, then we have 
gxl=(fxpxla=feuel=freaf. 


This completes the proof. | 


Exercises 


1. Solve the following equations: 
(a) yp (2*5”) = 80; 
(b) y(n) = 12, 
(c) p(n) = 2n/3; 
(d) p(n) =n/2; 
(e) (yn) = 283°. 


2. If (p,q) = 1, prove that 


Hint: Use the identity 


|=] ee) _ 
=p-l, 
q q 


fork = 1,2,...,q — 1, and show that both sums equal (p — 1)(g — 1)/2. 


6.3 Multiplicative Functions 


In this section, we study arithmetical functions called multiplicative functions. These 
functions have the property that their value at the product of two relatively prime 
integers is equal to the product of the value of the functions at these integers. 
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Definition 6.16 An arithmetical function f is called multiplicative if f is not identi- 
cally zero and if f(mn) = f(m) f(a), whenever (m,n) = 1. A multiplicative func- 
tion f is called completely multiplicative if we also have f (mn) = f(m) f (n), for 
all positive integers m and n. 


Example 6.17 The Euler function ¢ is multiplicative. This is easily seen from part 
(3) of Theorem 6.8. But it is not completely multiplicative, since y(4) = 2 and 
p(2)p(2) = 1. 


Example 6.18 The Mobius function 4 is multiplicative. This is easily seen from 
Definition 6.4. But itis not completely multiplicative, since u(4) = O and (2) u(2) = 
1. 


Example 6.19 The identity element e of the group AF is completely multiplicative. 


Example 6.20 Let f and g be two arithmetical functions. Then, the ordinary product 
fg and the quotient product f/g are defined by 


(fg) = finger), 
—)Ja@= Ltn) whenever g(n) # 0. 
8 g(n) 


If f and g are multiplicative, so are fg and f/g. 
Theorem 6.21 [f f is multiplicative, then f (1) = 1. 


Proof Since (n, 1) = 1, for all positive integer n, it follows that f(n) = f(n) fC). 
Since f is not identically zero, it follows that f(n) 4 0, for some positive integer n. 
So, we get f(1) = 1. | 


The following is a nice characterization of multiplicative functions. 
Theorem 6.22 Let f be an arithmetical function with f (1) = 1. Then, 


(1) f is multiplicative if and only if 


SOP oP VHP OT a FE): 


for all primes p; and all positive integers a;; 
(2) If f is multiplicative, then f is completely multiplicative if and only if 


FP") = f(P)", 
for all primes p and all positive integers a. 


Proof The proof follows easily from the definition and is left as an exercise for the 
reader. a 


The following are further examples of well-known arithmetical functions. 
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Example 6.23 Let n be a positive integer. 
(1) The divisor function T is defined to be the number of positive divisors of n, i.e., 
T(n) = l{d € N| d|n}|; 


(2) The sum of divisors function o is defined to be the sum of all positive divisors 


of n, 1.e., 
a(n) = Sod. 


d\n 


Theorem 6.24 The divisor function o and the sum of divisors function o are multi- 
plicative. Their values at primes powers are given by 


pe —] 


T(p®) =a+ land oa(p®) = i 


Proof In order to prove 7 is multiplicative, assume that n and m are positive integers 
with (m,n) = 1. Note that if d,|m and d2|n, then d,d|mn. Conversely, if d|mn, then 
d = dd such that d,|m and d|n. Therefore, there exists a one to one correspondence 
between the set of divisors of mn and the set 


A= {(d,, dz) | di|m and d3|n}. 


Since 7(mn) is the number of divisors of mn and |A| = T(m)r(n), it follows that 
T(mn) = T(m)T(n). Similarly, we can prove that o is multiplicative. Indeed, we have 


o(mn) = Sod = So didp = Yod Sid =a(m)o(n). 


d\n ay|m d\|m d\n 
dy|n 


Moreover, since the divisors of p® are exactly 1, p,..., p®, it follows that T(p®) = 
a+ 1. Moreover, if we apply the geometric series formula, then we obtain 


o(p*) =1+pt---+ p* = ——_, 


as desired. a 
Theorem 6.25 If f and g are multiplicative, so is their Dirichlet product f * g. 


Proof Leth = f * g and (m,n) = 1. Then, we have 


h(mn) = YY fds (=). 


d\|mn 
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Suppose that d = ab such that a|m and b|n. Then, we obtain 


n(mn) = Y° flab)e (=) = r@se (=) 8 (F) 


a\m a\m 
bin bin 


=D r@s (=) £Oe (2) = hemncn. 
b|n 


a\m 
Thus, / is also multiplicative. | 
Theorem 6.26 If g and f * g are multiplicative, then f is also multiplicative. 


Proof Assume that f is not multiplicative and let h = f x g. Then, there exist pos- 
itive integers m and n with (m,n) = 1 such that f(mn) ~€ f(m) f (n). By the well- 
ordering principle, we choose such a pair m and n for which the product mn is as 
small as possible. We consider the following two cases: 

Case 1: If mn = 1, then f(1) 4 fC) fC), and so f(1) £1. Since h(1) = 
f(g) = fd) £0, it follows that h is not multiplicative, and this is a contra- 
diction. 

Case 2: Ifmn > 1, then f(ab) = f(a) f(b), for all positive integers a and b with 
(a, b) = 1 and ab < mn. Therefore, we have 


h(mn) = > flab)g (=) 


a\m 
d\n 


de flab)g (=) + fomng() 


d F@sFOs (=) e(Z) + fomn) 
bln 
ab<mn 


d flag (=) 3 Fe (F) — fomsr + fomn) 
bin 


a\m 


=h(m)h(n) — fim) f(r) + f (mn). 


Since f(mn) € f(m)f (n), it follows that h(mn) 4 h(m)h(n). This shows that h is 
not multiplicative, and it is a contradiction. | 


Theorem 6.27 If f is multiplicative, so is f~', its Dirichlet inverse. 


Proof The result can be easily obtained from Theorem 6.26. In fact, since both f 
and f—! * f =e are multiplicative, it follows that f—! is multiplicative. a 


Corollary 6.28 The set of multiplicative functions is a subgroup of AF. 
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Proof It is clear by Theorems 6.25 and 6.27. a 


Theorem 6.29 Let f be multiplicative. Then, f is completely multiplicative if and 
only if f~'(n) = p(n) f (n), for each positive integer n. 


Proof Let g(n) = y(n) f (n). If f is completely implicative, then 


(«Nn = Du@s@s (F) = FO) Yo u@ = fen) = etn), 


d\n d\n 


because f(1) = 1 and e(n) = 0, for each n > 1. Hence, we have g = f~!. 
Conversely, suppose that f~'(n) = p(n) f (n). In order to prove that f is com- 

pletely multiplicative, it is enough to show that f(p°) = f(p)* for prime powers. 

Since f * f-!' = f x uf =e, it follows that (f * uf)(n) = e(n). This yields that 


dima fas (5) =o. 
d\n 


for each integer n. Now, we take n = p®, then obtain 


wD SDS (p%) + wp) f (p) f(p*") = 0. 


Therefore, we conclude that f(p*) = f(p)f(p°"!). This shows that f(p°) = 
f(p)*. a 


Exercises 


1. Prove that 
plnjo(n) + 1 


n 


is an integer ifn is prime, and it is not integer if n is divisible by square of a prime. 
2. Prove that f is multiplicative if and only if its sum-function $+ is multiplicative. 
3. Let f(n) = [/n] — [Vn — 1]. Prove that f is multiplicative but not completely 
multiplicative. 
4. Solve the equation y(o(2”)) = 2”. 


6.4 Worked-Out Problems 


Problem 6.30 If y(m)|m — 1, prove that there does not exist prime number p such 
that p*|m. 
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Solution Suppose that there exists prime number p; such that p;\m. Let m= 


y Qj 7 
Py'.-. pj’ ++. Py’, where a; > 2. Then, we have 


-1 aj—2 


é is ~1 , 
pi|p; (p;’ | - Py) = py! — P| = eP)).- 


aj 


On the other hand, since p;’|m, it follows that y(p;')|y~(m). So, we conclude that 


pj\y(m). Since p; {m — 1, it follows that y(m) { m — 1. This is a contradiction. 


Problem 6.31 If m is not prime and y(m)|m — 1, prove that m has at least three 
distinct prime factors. 


Solution Suppose that m has exactly two distinct prime factors. By Problem 6.30, 
since y(m)|m — 1, we conclude that m = pq, where p and q are distinct primes. 
Then, we have 
m—1 _ pq-1 
em) (p—D@-1 
_ Pa-~p-aqtit+ptq-2 


(= Dig =1) 
_ @-)a@—-Y p=1 ie q-1 
(P-l@-) @®-VYa@-) @W-V@¢-D) 
ie) ae 
q-1 p-\l 


So, we must have 


a, ST or, 
q-1 p-tl 


This implies that p = g = 2 or p = q = 3. But this contradicts our assumption that 
p and q are distinct. Consequently, it is impossible for m to have only two distinct 
prime factors. | 


Problem 6.32 Show that if m > 2, then the sum of all positive integers which are 
less than m and relatively prime to m is (1/2)my(m). 


Solution Suppose that 1 < k < m such that (k,m) = 1. If (nm —k,m) =d, then 
d|m and d|m — k. This implies that d|k, and so d = 1. Since k < m, it follows that 
m—k > 1. Also, we have m — k < m. Now, assume that 


A={k|1<k<m, (k,m) =}, 
B={m—k|1<k<m, (k,m) =}. 
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There is a one to one correspondence between A and B given by f(k) =m —k. 
Consequently, we have |A| = |B| = y(m). Therefore, we get 


(ki + (im — ki) +22 + (Room + (m= kom) = 2 D5 k. 
O<k<m 
(k,m)=1 


This shows that 
m+---+m=2S, 
eee 


y(m) times 
where S is the required sum. Therefore, we deduce that S = (1/2)my(m). | 


Problem 6.33 Let f : N — N be multiplicative and strictly increasing. If f(2) = 2, 
prove that f(n) = n for all n. 


Solution We have f(1) = 1 and f(2) = 2. First, we show that f(3) = 3. Sup- 
pose that f(3) =k +3, where k is a non-negative integer. We can write f(6) = 
fQFC) = 2(k + 3) = 2k + 6. Since f is strictly increasing, it follows that f(5) < 
2k +5,andso f(10) = f(2) f (5) < 4k + 10. Hence, we must have f (9) < 4k + 9, 
which implies that f(18) = f(2) f(Q) < 8k + 18. This gives 


fd5) < 8k + 15. (6.5) 
On the other hand, we have 
fds) = fAfGS) = kK+3)K+ 5) =k? +8k +15. (6.6) 


By (6.5) and (6.6) we conclude that k = 0. Hence, f(3) = 3, as desired. Now, we 
use mathematical induction. Suppose that 


f(m) =m, for allm = 1,2,...,2k—1, 
where k > 2. Then, we have 


f 4k — 2) = f(2@k-1)) = os —1) 
= 2f (2k —1) = 4k -2 


Since f is strictly increasing, it follows that f(m) = m, forall2k — 1 <m < 4k —2. 
In particular, we deduce that f(2k) = 2k and f(2k + 1) = 2k + 1. This completes 
the proof. 


Problem 6.34 Prove that 


9 ped) 


30 — eld) 
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Solution Since s and yy are multiplicative and y(n) ¥ 0, for each positive integer n, 
it follows that j.7/y is multiplicative. So, if G = S 2/, Sum-function, then 


ye (d) 


G(n) = : 
a 8D) 


and G is multiplicative, since (u?/p) * I = G. Now, ifn = p}''... pe, then 


G(n) = G(py')... G(py") 


1 1 
=(1 (1 
( + a) ( +o) 


_ Pl Pk 
a-1' p-1 
n 
1 1 
n{l—-—]...f{1-— 
P\ Pk 
 ® 
p(n)” 
This completes the proof. | 


Problem 6.35 Let G bea finite abelian group. We say that f :: G > Cisacharacter 
on G if (1) f(x) £0, for all x € G, and (2) f(xy) = f(x) fQ), for all x, y € G. 
Suppose that f is a character on the multiplicative group U;,. We may extend the 
domain of this character to the entire set of natural numbers in the following manner: 
First, let 7 € Ux be the equivalence class modulo k containing n. We extend the 
domain of f to the entire set of natural numbers as follows: 


_{f@ if. =1 
ps {i otherwise. 


We call y a Dirichlet character modulo k. It is easy to see that y has the following 
two important properties: 


(1) It is k-periodic, i.e., y(a + k) = x(a), for all a € N; 
(2) Itis completely multiplicative. 


Table 6.2 shows the Dirichlet characters modulo 5. It is not difficult to check that no 
more exist. 

If y is an arithmetical function that is both periodic and completely multiplicative, 
and it is not zero function, prove that it is a Dirichlet character. 


Solution Let k be the minimal period of x. Since xy is k-periodic, it is constant in 
each equivalence class modulo k. Moreover, since it is completely multiplicative, its 
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Table 6.2 Dirichlet characters modulo 5 
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x (mod 5) 0 1 2 3 4 
0 1 

x2 0 1 -l -l 1 

x3 0 1 i -i -l 

x4 0 1 -i i -l 


values on U;, make it a character. Therefore, we need to show that y(n) = 0 if and 
only if (n, k) # 1. First, assume (n, k) = 1. Then we have n?“ = 1(mod k). Since 
x is completely multiplicative and k-periodic, we have x(n)° = x (n?) = x(1). 
Thus, if v(m) = 0, then y(1) = 0. However, if x(1) = 0, then we have x(m) = 
xC)x(m) = 0, for all m. Hence, if (1, k) = 1 and y(n) = 0, then x is zero function. 
On the other hand, we assume that there is n such that (n,k) > 1 and y(n) £0. 
Then there is at least one prime p such that p|k and x(p) 4 0. Consider this p, and 
let m be any natural number. Because y is k-periodic and completely multiplicative, 
we have 


k 
x(m)x(p) = x(mp) = x(mp + k) = x(p)x (m + -) 


Since y(p) 4 0, we must have x(m) = x(m + k/p), for all m. But this means that 
x is k/ p-periodic. This violates the stipulation that k is the minimal period of y. 


6.5 Supplementary Exercises 


1. Let n be an integer with n > 2. Show that y(2” - 1) is divisible by n. 
2. If p is a prime and n is an integer such that | <n < p, prove that 


p-l 
y (x ") = (0(mod p). 
k=0 


3. Let m and n be positive integers. Prove that, for some positive integer a, each of 
pla), p(at+1),..., pla +n) is a multiple of m. 

4. If n is composite, prove that y(n) <n — Jn. 

5. Find all solutions of y(n) = 4, and prove that there are no more. 

6. Find m, n € N such that they have no prime divisors other than 2 and 3, (m,n) = 
18, T(m) = 21, and r(n) = 10. 

7. Determine an arithmetical function f such that 


1 1 n 
=D) wen 


p(n) 
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8. An arithmetical function f is called periodic if there exists a positive integer k 
such that f(n +k) = f(n) for each positive integer N; the integer k is called 
a period for f. Show that if f is completely multiplicative and periodic with 
period k, then the values of f are either 0 or roots of unity. 

9. Let f be a multiplicative function satisfying paaee JS (p”) = 0. Show that im 


f(n) =0. 


10. Prove that the sum-function S'y of a multiplicative function f is given by 


k 
Spm) =] [+ fOr) + FP) + + FRM), 
i=l 


whenever n = py... p;*. 
11. For any real number x > 1, prove that 


Pera 


n<x 


12. Prove that a finite abelian group G of order n has exactly n distinct characters. 
13. For two sequences of complex numbers {do, a1,...,@,,...} and {bo, bj,..., 
b,, ...} show that the following relations are equivalent: 


n n 


dn = So bk for alln & b, = Yiepitra for all n. 
k=0 k=0 


14. Prove that 


Yr = DEF] ane Dow = Dez) 


k=1 


Chapter 7 M®) 
Matrix Groups cies 


Among the most important examples of groups are groups of matrices. In this chapter, 
matrix groups are defined and a number of standard examples are discussed. 
7.1 Introduction to Matrix Groups 


Let A= (ai;) 


(dif) mp? where 


and B = (bij) Pe We recall the product AB is the matrix C = 


mxn nx 
n 
Cij = ) Aix by;. 
k=l 


Theorem 7.1 Matrix multiplication is associative. 

Proof Let A = Ce ane B= (ii) np and C = (Ci) pea be three arbitrary matri- 
ces. Suppose that AB = D = (dis) mxcp and BC = E= (€ij) neq: Moreover, suppose 
that 


(AB)C = DC = X = (jj) and A(BC) = AE =Y = (yj) 


mxq mxq° 


Then, we obtain 


Pp 
xij = ) dic1j 
i=1 
P n 
=> ) AikDEj ) C1 


l=1 \k=1 
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(ai1bu + ai2bo) + +++ + Ginbu) ci 
1 


P 
l= 


P P p 
= aj) ) bycij + ai2 ) bycijy + +++ + Gin ) Dnt Ci; 
[=I [=1 i=1 


ye (> ber) 


k=1 l=1 
n 
= ye Dikekj 
k=1 
Therefore, we deduce that X = Y. |_| 


A matrix A with n rows and n columns is called a square matrix of order n. 
One example of a square matrix is the n x n identity matrix I,. This is the matrix 


x (Si) defined by 
ee lifi=j 


0 if i Fj. 


nxn 


Definition 7.2 Let A be an n x n matrix over F. An n x n matrix B such that 
BA = I, iscalleda left inverse of A. Similarly, ann x n matrix B such that AB = I, 
is called a right inverse of A. If AB = BA = I,, then B is called an inverse of A 
and A is said to be invertible. 


Lemma 7.3 [fa matrix A has a left inverse B and a right inverse C, then B = C. 


Proof Suppose that BA = I, and AC = I,. Then, we obtain 


B= BI, = B(AC) = (BA)C=1,C =C 


as desired. |_| 


Thus, if A has a left and a right inverse, then A is invertible and has a unique 
inverse, which we shall denote by A7!. 


Theorem 7.4 Let A and B be two matrices of the same size over F. 
(1) If A is invertible, so is A! and (Aq!) =A. 
(2) Ifboth A and B are invertible, so is AB and (AB)! = BAT}, 


Proof (1) It is evident from the symmetry of the definition. 
(2) We have 
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(AB)(B-!A7!) = A(BB"')A~! = AI, A7! = AA! =I. 
Similarly, we obtain (B~'!A~!)(AB) = I,. This completes our proof. | 


Theorem 7.5 /f 
GL, (F) = {A € M,., | A is invertible}, 


then GL, (EF) is a group under the multiplication of matrices. 


Proof The proof follows from Theorem 7.1, the definition of identity matrix and 
Theorem 7.4. a 


GL, (EF) is called the general linear group. 


Definition 7.6 The following three operations on a matrix are called elementary row 
operations: 


(1) Multiply a row through by a non-zero constant. 
(2) Interchange two rows. 
(3) Add a multiple of one row to another row. 


Indeed, an elementary row operation is a special function f which associate with 
each matrix A = (a i) a matrix f(A) = (b; i) such that 


mxXn mxn 


(1) bij = aij ifi Z#k, and b,; = Cdkj, 
(2) bj; = a;; if i is different from both k and /, and by; = aj;, bij = ax;, 
(3) bij = aij if i Z#k, and by; = agj + cajj. 


Lemma 7.7 To each elementary row operation f there corresponds an elementary 
row operation g, of the same type as f, such that f(g (A)) a g(f(A)) = A, for each 
A. 


Proof (1) Suppose that f is the operation which multiplies the kth row of a matrix 
by the non-zero scalar c. Let g be the operation which multiplies row k by c7!. 

(2) Suppose that f is the operation which replaces row k by row k plus c times 
row I, where k # /. Let g be the operation which replaces row k by row k plus —c 
times row J. 

(3) If f interchanges rows k and /, let g = f. 

In each of the above cases, we clearly have f(g(A)) = e(f(A)) = A, for each 
A. a 


Ann Xx n matrix is called an elementary matrix if it can be obtained from the 
identity matrix J,, by performing a single elementary row operation. 


Lemma 7.8 [fthe elementary matrix E results from performing a certain elementary 
row operation f on I, and if Aisanm x n matrix, then the product E A is the matrix 
that results this same row elementary operation is performed on A, i.e., f (a) = EA. 
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Proof It is straightforward by considering the three types of elementary row opera- 
tions. a 


Example 7.9 Consider the matrix 


and consider the elementary matrix 
100 
010 
401 


which results from adding 4 times the first row of J; to the third row. The product 
EA is 

15 41 
3 230 
5 1919 5 


Theorem 7.10 Each elementary matrix belongs to GL, (F). 


Proof If Ais ann x n elementary matrix, then A results from performing some row 
operation on /,. Let B be the n x n matrix that results when the inverse operation 
is performed on J/,,. Applying Lemma 7.7 and using the fact that inverse row opera- 
tions cancel the effect of each other, it follows that AB = J,, and BA = I,. So, the 
elementary matrix B is the inverse of A. This yields that A € GL, (F). a 


Definition 7.11 Anm x n matrix R is called row-reduced echelon matrix if 


(1) the first non-zero entry in each non-zero row of R is equal to 1, 

(2) each column of R which contains the leading non-zero entry of some row has 

all its other entries 0, 

every row of R which has all its entries 0 occurs below every row which has a 

non-zero entry, 

(4) ifrows 1,...,7 are the non-zero rows of R, and if the leading non-zero entry of 
row i occurs in column k;,i = 1,...,7r, thenk; < ky) <--- <k,. 


(3 


wm 


Example 7.12 (1) One example of row-reduced echelon matrix is [,. 
(2) The matrix 


3 
2 
0 


a row-reduced echelon matrix, too. 
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Matrices that can be obtained from one another by a finite sequence of elementary 
row operations are said to be row equivalent. We left to the readers to show that each 
matrix is row equivalent to a unique row-reduced echelon matrix. 

If one defines an elementary column operation and column equivalent in a manner 
analogous to that of elementary row operation and row equivalent, it is clear that each 
m X n matrix will be column equivalent to a column-reduced echelon matrix. Also, 
each elementary column operation will be of the form A — AE, where E isann x n 
elementary matrix, and so on. 


Definition 7.13 A diagonal matrix is a square matrix in which the entries outside 
main diagonal are all zero. A diagonal matrix whose diagonal elements all contains 
the same scalar c is called a scalar matrix. 


Corollary 7.14 The set of all n x n invertible diagonal matrices is a subgroup of 
GL, (EF). 
Definition 7.15 A matrix A = (ai i) , 18 called 


nx 
(1) an upper triangular if a;; = 0 fori > j. 
(2) a lower triangular if a;; = 0 fori < j. 


Let UT,,(F) be the set of upper triangular matrices such that all entries on the 
diagonal are non-zero. 


Theorem 7.16 UT,,(F) is a subgroup of GL, (F). 


Proof Suppose that A and B are two arbitrary elements of UT,,(F). Let C = 
cra = AB. Ifi > j, then 


ep = Se dinbsj = Oand cj = ajjbii A O, 
k 


which shows that C € UT,,(F). 
Now, we notice that 


(1) The elementary matrix corresponding to multiply the ith row by a non-zero 
constant c is the matrix with ones on the diagonal, expect in the ith row, which 
instead has a c, and zero everywhere else. 

(2) If we consider a replacement that add a multiple of the ith row to jth row, where 
i < j, then this matrix has non-zero entries only along the diagonal and in the 
ijth entry, which is above the diagonal. Therefore, all entries bellow the diagonal 
are zero. 


The above row operations are sufficient to row reduce A to [,. Since A is upper tri- 
angular, it follows that there exists a sequence of row operations of the type described 
in the above that transforms A into [,,. Consequently, there is a sequence of upper tri- 
angular elementary matrices FE), E2,..., Ex such that Ey... EyE,A = In, or equiv- 
alently 
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Av! = Ey... ExE}. (7.1) 


The right side of (7.1) is a product of upper triangular matrices. Hence, the result of 
this product is also an upper triangular matrix. Hence, we conclude that the inverse 
of A and lies in UT,,(F). This completes our proof. | 


Let LT,,(F) be the set of lower triangular matrices such that all entries on the 
diagonal are non-zero. 


Theorem 7.17 LT, (F) is a subgroup of GL, (F). 
Proof The proof is similar to the proof of Theorem 7.16. a 


Theorem 7.18 /f A isn x n matrix, then the following are equivalent: 


(1) A€GL,(), 
(2) A is row equivalent to I, 
(3) A is a product of elementary matrices. 


Proof Suppose that R is a row-reduced echelon matrix which is row equivalent to 
A. Then, by Lemma 7.8, there exist elementary matrices F),..., E% such that 
R=E,...E\A 
Since each £; is invertible, it follows that 
A=E,'...E,'R. 


Since products of invertible matrices are invertible, we conclude that A is invertible 
if and only if R is invertible. Since R is a square row-reduced echelon matrix, it 
follows that R is invertible if and only if each row of R contains non-zero entry, that 
is, if and only if R = [,,. We have now shown that 

A is invertible & R= J, 
and if R = J then A = ze oe EE. This proves that (1), (2) and (3) are equivalent 
statements about A. a 


Corollary 7.19 The general linear group GL, (F) is generated by elementary matri- 
ces. 


Corollary 7.20 Let A be an invertible matrix. If a sequence of elementary row 
operations reduces A to the identity, then that same sequence of operations when 
applied to I, yields A~'. 


Example 7.21 We want to find the inverse of 


11 2 
A=|3 4 5], 
2 6-1 
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if exists. If we are able to convert matrix A to the identity matrix by using elementary 
row operations, i.e., 


[A | Jn] —> Un | Bl, 


then A is invertible and A~! = B. The computation can be carried out as follows: 


112/100 1 1 2; 100 
3 4 5;010;—~)]0 1 -1;-3 10 
2 6-1/001 26-1; 001 
1 1 2) 100 10 3) 4 -1 0 
— }0 1-1/-3 10|;—)]01-1}/-3 10 
0 4 —5)-2 0 1 0 4-5);-2 0O 1 
10 3; 4-1 0 1 0 Oj 34 -13 3 
— |]0 1 -1;-3 1 0;—~—7]0 1 -1)-3 1 0 
0 0-1; 10 -4 1 0 0-1; 10 -4 1 
10 Oj; 34 -13 3 100/34 —13 3 
— |0 1 -l| -3 1 Oj} —~]0 1 0/-135 -1 
0 0 1)-10 4 -1 0 0 1 |-104 -1 
Therefore, we get 
34 -13 3 
A=] -13 5-1 
—10 4-1 


Definition 7.22 The determinant of a matrix A, written det(A), is a certain number 
associated to A. If A = (ai nxn, then determinant of A is defined by the following: 


det(A) = és, sgn(c) I] iio) 
1 


oESy i= 


Theorem 7.23 [fn = 1, then det(A) = ay. [fn > 1, then 


det(A) =) ajAy, i=1,...,7 (7.2) 
j=l 
and , 
det(A) = Yo ajjAij, Fo Ree Pe (7.3) 


i=1 


such that Aj; is the ij-cofactor associated with A, i.e., 
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Ajj = (—1)'*/ det(Mi;), 


where Mj; isn — 1) x (n — 1) matrix obtained from A by removing its ith row and 
jth column. The Mj;’s are called the minors of A. 


Proof The proof follows from the expansion of the right sides of (7.2), (7.3) and 
using the definition of determinant. | 


Formula (7.2) is called the expansion of the determinant through the ith row and 
Formula (7.3) is called the expansion of the determinant through the jth column. An 
alternative notation for the determinant of a matrix A is |A| rather than det(A). 


Example 7.24 Evaluate det(A), where 
23 3 
A=|4 5-1 
12 3 
We have 


2 3 ae 2 
= 2(5-3—(—1)-2) —3(4-3-(-1)-1)+34-2-5-D=4. 


det(A) = 2|> “3-3/1 +3] ; 


Lemma 7.25 Let A be anyn x n matrix. 


(1) If B is the matrix that results when a single row of A is multiplied by a constant 
c, then det(B) = c det(A). 

(2) If B is the matrix that results when two rows of A are interchanged, then det(B) = 
— det(A). 

(3) If B is the matrix that results when a multiple of one row of A is added to another 
row, then det(B) = det(A). 


Proof It is straightforward. | 


Lemma 7.26 If E is ann x n elementary matrix and A is any n x n matrix, then 
det(E A) = det(E) det(A). 


Proof We consider the three different types of elementary row operation. 

(1) Let E be the elementary matrix obtained from /,, by multiplying the entries 
of some row of J, by a non-zero scalar c. Then, we can row reduce E to J, by mul- 
tiplying the same row by i Hence, we obtain det(/,) = 1 det(E), or equivalently 
det(Z) = c. Since EA is the matrix resulting from multiplying the entries of a row 
of A by c, it follows that det(ZA) = cdet(A) = det(E) det(A). 

(2) Let E be the elementary matrix obtained from J, by interchanging two rows 
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of /,,. Then, det(Z) = — det(/,,) = —1. Since EA is the matrix resulting from inter- 
changing the corresponding two rows of A, it follows that det(Z.A) = (—1) det(A), 
and so det(E A) = det(E) det(A). 

(3) Let E be the elementary matrix obtained from J, by adding a multiple of 
one row of J, to another row of J,. Since row operations of this type do not 
change the determinant, it follows that det(Z) = det(/,) = 1. Since EA is the 
result of adding a multiple of a row of A to another row of A, it follows that 
det(E.A) = 1 - det(A) = det(£) det(A). 

So, no matter what type of elementary matrix E is, we have shown that det(Z A) = 
det(E) det(A). Now, we are done. | 


Theorem 7.27 A square matrix A is invertible if and only if det(A) 4 0. 


Proof Suppose that E),..., E, are elementary matrices which place A in reduced 
row echelon matrix R. Then, R = E;,... E, A. By Lemma 7.26, we have det(R) = 
det(E;,) ...det(E;) det(A). Since the determinant of an elementary matrix is non- 
zero, it follows that det(A) and det(R) are either both zero or both non-zero. 

Now, if A is invertible, then the reduced row echelon form of A is J,,. In this case, 
det(R) = det(/,,) = 1 which implies that det(A) 4 0. 

Conversely, if det(A) 4 0, then det(R) 4 0 which implies that R can not have a 
row of all zeros. This implies that R = /,, and so A is invertible. |_| 


Corollary 7.28 We have 
GL, (F) = {A € Mnxn | det(A) F 0}. 


Theorem 7.29 A square matrix A = (Gs ase is invertible if and only if AX = 0, 


1.é., 
Qi, Q\2..- Alyn xX] 0 
a21 A22 ... Ady X2 0 
Gn\ Gn2 --+ Ann Xn 0 


has only the trivial solution. 


Proof Suppose that A is invertible and Xo be any solution to AX = 0. Thus, 
AXo = 0. Multiplying both sides of this equation by A~! gives A~!(AXo) = A7!0, 
or equivalently (A~'A)Xo = 0. This implies that Xo = 0. Thus, AX = 0 has only 
the trivial solution. 

Conversely, if AX = 0 has only the trivial solution, then A must be row equiva- 
lent to a reduced row echelon matrix R with n leading 1s. Hence, R = IJ,,. Now, by 


Theorem 7.18 our proof completes. | 
Definition 7.30 Let A be a square matrix and V, V2,..., V, be the columns of A. 
We say that V;, V2,..., V, are linearly independent if and only if the only solution 


for 
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xyVj + x2Vo +---+24,V, =0 (with x; € F) 


is X) =X, =--- =x, = 0. Note that the equation x, V, + x.V2+---+%x,V, =0 
can be rewrite as AX = 0. 


Corollary 7.31 A square matrix A is invertible if and only if the columns of A are 
linearly independent. 


Proof The proof results from Theorem 7.29 and Definition 7.30. a 


Theorem 7.32 [fF is a finite field with q elements, then 


n-1 


|IGL,(F)| =] [@" - 4"). 


k=0 


Proof We count n x n matrices whose columns are linearly independent. We can 
do this by building up a matrix from scratch. The first column can be anything other 
than the zero column, so there exist gq” — 1 possibilities. The second column must 
be linearly independent from the first column, which is to say that it must not be 
a multiple of the first column. Since there exist g multiples of the first column, it 
follows that there exist g” — q possibilities for the second column. In general, the ith 
column must be linearly independent from the first i — 1 columns, which means that 
it can not be a linear combination of the first i — 1 columns. Since there exist g'~! 
linear combinations of the first i — 1 columns, it follows that there exist g” — q'~! 
possibilities for the ith column. Once we build the entire matrix this way, we know 
that the column are all linearly independent by choice. Moreover, we can build any 
n Xn matrix whose columns are linearly independent in this way. Consequently, 
there exist 


@P=H@ =ag =q") 
matrices. |_| 


Theorem 7.33 The center of GL, (F) is 
Z(GL,(F)) = {al, | a € F*}. 


This means that the center of general linear group is the subgroup comprising scalar 
matrices. 


Proof We suppose that n > 1. Each elements of Z (GL, (F) must commute with 
every elements of GL, (F). In particular must commute with elementary matrices. 
We complete the our proof in four steps. 

Step 1: Each element of Z(GL,, (F) commutes with off-diagonal matrix units. 
Indeed, assume that c € F* and e;;(c) is a matrix with c in ijth entry and zeroes 
elsewhere. Specially, e;;(1) is called the ijth matrix unit. Now, we define E;;(c) = 
I, + e;;(c). Then, we have 


7.1 Introduction to Matrix Groups 159 


(1) Since £;;(—c) is the inverse of E;;(c), it follows that E;;(c) € GL» (F). 
(2) Any matrix that commutes with £;;(1) must commute with e;;(1) 


Step 2: If a matrix commutes with off-diagonal matrix units, then it is a diagonal 
matrix. Indeed, let A be a matrix such that a;; A Oforsomei ¢ j and let B = e;;(1). 
Then, j jth entry of AB is non-zero, while the jth entry of BA is zero. Consequently, 
any matrix commutes with all of off-diagonal matrix units e;;(1) can not have any 
off-diagonal entries. 

Step 3: A permutation matrix is a matrix obtained by permuting the rows of 
I, according to some permutation of the numbers | to n. A permutation matrix is 
invertible. Let A be a diagonal matrix with a;; 4 a;; and let B be a permutation 
matrix obtained by permuting the ith and jth rows of J,,. Then, A does not commute 
with B. Therefore, we conclude that any diagonal matrix that commutes with all 
permutation matrices is scalar. 

Step 4: Combining the first two steps yields that any matrix in Z (GL, (F)) must 
be diagonal, and the third step then yields that it must be scalar. | 


Corollary 7.34 For any n x n matrices A and B, 

AB=I1, & BA=I,. 
Proof It is enough to prove that BA = I, implies that AB = I,. Let BA = IJ,. 
If AX = 0, then B(AX) = BO = 0. Hence, (BA)X = 0, or equivalently J, X = 0. 


This implies that X = 0. Now, by Theorem 7.29, we conclude that A is invertible. 
Since BA = I,, it follows that 


A(BA)A"! = Al, A7! = I,. 
This implies that (AB)AA~! = I,. Thus, we conclude that AB = I. |_| 


Corollary 7.35 Let A and B be twon x n matrices. If AB is invertible, then A and 
B are invertible. 


Proof Suppose that C = B(AB)~! and D = (AB)~'A. Then, we obtain 


AC = A(B(AB)~') = (AB)(AB)"! = Ih, 
DB = ((AB)"'A)B = (AB)"|(AB) = Ih. 


Now, by Corollary 7.34, we deduce that C = A~! and D = B™!. i 
The result in Lemma 7.26 can be generalized to any two n x n matrices. 


Theorem 7.36 Jf A and B are twon x n matrices, then 


det(AB) = det(A) det(B). 
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Proof We consider two cases: 
(1) If A is invertible, then there exist elementary matrices E\,..., E, such that 
A=E,...E,. Then, we have 


det(AB) = det(E,... E,B) 
= det(E;) det(E>... E, B) 


= det(E,) ...det(E,) det(B) 
= det(E, E>) det(E3) ... det(E,) det(B) 


= sttB ase) 
= det(A) det(B). 


(2) If A is not invertible, by Corollary 7.35, AB is not invertible. This yield that 
det(AB) = 0 = det(A) det(B), and so the theorem holds. | 


Corollary 7.37 [If A isn x n invertible matrix, then 


1 


det(A~!) => det(A)’ 


Proof Since AA~! = 1,, it follows that det(AA~!) = det(J,) and so det(A~!) 
det(A) = 1. Since det(A) 4 0, the proof can be completed by dividing through by 
det(A). a 


Theorem 7.38 [f 
SL,(F) = {A € GL, (F) | det(A) = 1}, 


then SL,(F) is a subgroup of GL, (F). 


Proof Let A, B € SL, (F) be arbitrary. Since det(A) = det(B) = 1, by Theorem 
7.36 we conclude that det(A B) = 1. This implies that AB € SL,,(F). Moreover, by 
Corollary 7.37, det(A~!) = 1, and hence A~! € SL, (F). a 


SL, (EF) is called the special linear group. 


Theorem 7.39 The center of SLy(F) is 


Z(SLn@)) = {al, | a” = 1). 
Proof The proof results from the definition of SL,(F) and Theorem 7.33. a 


Theorem 7.40 The special linear group SL2(F) is generated by matrices of the 
form 
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forallr,s € F*. 
Proof Suppose that 
ab 
cd 
is an arbitrary element of SL2(F). We consider the following three cases: 
Case 1: If b £ 0, then 


ab] _ 1 o]f1o5 i 0 
cd} | d-bb' 1], 014] @-pet 1] 


Case 2: If c £ 0, then 


ab] _ [1 (a—Nc"! 10 1 d—1)c"! 
cd| | 0 1 cl 0 1 . 


Case 3: If b= c = 0, then 


eee eter eilenue | | 


and we are done. |_| 


Let e;;(A) is a matrix with . in ijth entry and zeroes elsewhere, and let E;;(A) = 
Tn + eij (A). 


Theorem 7.41 The special linear group SL,(F) is generated by matrices of the 
form E;;(A) withi # j and € F*. 


Proof We apply mathematical induction. If n = 2, then by Theorem 7.40, we are 
done. Suppose that the statement is true for each (n — 1) x (n — 1) matrix. Let A 
be any arbitrary matrix in SL, (IF). Multiplying A by £;;(A) on the left or right is an 
elementary row or column operation: 


aii bie Qin 
E,(A)A = ai, + Aaj cone din + AQ jn 
Gn tee Qnn 


and 
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ai, .-- aj + Aaj «++ Ain 
AEj;j(A) = : 


Gnl --- Anj + Ani »++ Ann 


These are the only operations we may use. Since det(A) = 1, it follows that some 
entry of the first column of A is not 0. If az; A O with k > 1, then 


= 1... 
Eu (- taal i |: (7.4) 
ak ame 


If az1,..., Gn are all 0, then a,; 4 0 and 


Hence, by (7.4) with k = 2, we obtain 


1 ee 
Fy2(1 — ay) E21 (—) A=)... . 
a1 ar 


When we have a matrix with upper left entry 1, multiplying it on the left by E;;(A) 
for i ~ 1 will all \ to the lith entry, hence with a suitable \, we can make the i 1th 
entry of the matrix 0. Consequently, multiplication on the left by suitable matrices 
of the form E;;(A) produces a block matrix 


1 x 

0B 
whose first column is all 0’s except for the upper left entry, which is 1. Multiplying 
this matrix on the right by FE) ;(A) for j A 1 adds to the | jth entry without changing 


a column other than the jth column. With a suitable choice of \ we can make the 
1 jth entry equal to 0, and carrying this out for j = 2,..., leads to a block matrix 


1 0 
0 ak (7.5) 


Since this matrix is in SL,(F), it follows that det(A’) = 1. This implies that 
A’ € SL,_-\(F). By induction hypothesis, A’ is a product of elementary matrices 
Eij(A)a—1)x(n—1)- We say 

A'=E,E)...E,. 


So, we have 
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low|-[oae.c|"loe|loml-loz) 


This yields that A is a product of n x n matrices Ej; (A). a 


Definition 7.42 Let A be an m x n matrix. Then, A’, the transpose of A, is the 
matrix obtained by interchanging the rows and columns of A. Geometrically, A‘ is 
obtained from A by reflecting across the diagonal of A. 

We say A is symmetric if A‘ = A and A is skew-symmetric if A' = —A. 


Assuming that the sizes of the matrices are such that the operations can be per- 
formed, the transpose operation has the following properties: 


(1) (4‘) =A, 

(2) (A+ By) =A'H B', 

(3) (cA)! = cA', where c € F, 
(4) (AB) = BYA’. 


Corollary 7.43 If A is any square matrix, then det(A) = det(A‘). 
Corollary 7.44 Any orthogonal matrix has determinant equal to | or —1. 


Theorem 7.45 [f 
On(F) = {A € GL,(F) | A'A = Th}, 


then O,(F) is a subgroup of GL, (F). 


Proof Let A and B be two arbitrary elements of O,(F). Then, we have A‘A = I, 
and B'B = I,. This implies that 


(AB) (AB) = BA'AB= BL, B= B'B=1]1,. 


So, we conclude that AB € O,,(F). Moreover, we have A‘ = A~!, and so 
(AA! = AAT= fh. 
This implies that A~! € O,(F). a 
O, (EF) is called the orthogonal group. We can define 
SO,(F) = SL,(F)N O,(). 


SO, (EF) is called the special orthogonal group. 
One can gets a whole of examples by fixing Q € M,,(F) and defining 


Go(F) = {A € GL, ) | A'QA = Q}. 
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Theorem 7.46 Go(EF) is a subgroup of GL, (F). 


Proof Suppose that A and B are two arbitrary elements of Gg(F). Then we have 
AB € Go(F). Indeed, 


(AB) Q(AB) = B'A'OAB = B'OB = Q. 


In addition, we have A~' € Gg(F). Indeed, when multiplying the equality 
QO = A‘ OA totheright by A~! and to the left by (A~!)', then we obtain (A~!)'QA7! = 
Q. a 


One important case is when n = 2m and we take Q to be the matrix 


0 —In 
Para 


Then, Gg(F) is denoted by S P2,,(F) and is called symplectic group 


Exercises 


1. If A is a symmetric matrix, is the matrix A~! symmetric? 
2. Show that the non-zero elements of Mat, ,(C) is not group under multiplication. 
3. For what values of a and b the following matrix belong to GL4(R): 


4. The trace of a square matrix A = (ai hes is 


tr(A) = — 
i=l 


Let A and B be two square matrices. 


(a) Prove that tr(AB) = tr(BA); 
(b) If B is invertible, show that the formula tr(B~!AB) = tr(A). 


5. Let A be a2 x 2 matrix over a field F. Prove that det(/2 + A) = 1 + det(A) if 
and only if tr(A) = 0. 

6. Let A be ann x n matrix over a field F. Prove that there are at most n distinct 
scalars c € F such that det(ch, — A) == 0. 


7.1 Introduction to Matrix Groups 165 


7. If A and B are invertible matrices and the involved partitioned products are 
defined, show that 


AB]'_[ 0 (og 
C70.) BaP SRS AGee |: 


8. Show that the non-zero elements of Mat, x, (C) is not group under multiplication. 
9. Let A = (ij) nen be a square matrix with a;; = max{i, j}. Compute det(A). 
10. Let A = (ii) nen be a square matrix with aj; = 1/(i + j). Show that A is invert- 
ible. 
11. Show that the set 


G= cosh x sinh x ixeR}, 
sinh x coshx 


in which sinh and cosh are hyperbolic functions, is a group under multiplication 
of matrices. 

12. Show by example thatif AC = BC, then it does not follow that A = B. However, 
show that if C is invertible the conclusion A = B is valid. 

13. Show that SL2(C) = SP2(C). 

14. Does the set {A € GL2(R) | A* = J} forma subgroup of GL2(R)? 

15. Let A be a2 x 2 matrix over a field F, and suppose that A? = 0. Show that for 
each scalar c that det(cIy — A) = c’. 

16. Determine the one-dimensional Lorentz group. That is, all2 x 2 matrices A such 


that 
1 0 1 O 
t — 
“Lo a}+=[0 2] 
17. Determine GL»(Zp), SL(Z), O7(Z2), SO(Z>), and SP3(Z>). 


: , where ad — bc # 0 and 
a,b, c,d are integers modulo 3. Show that |G| = 48; 

(b) If we modify the example of G in part (a) by insisting that ad — bc = 1, then 
what is |G|? 


19. (a) Let G be the group of all 2 x 2 matrices 


18. (a) Let G be the group of all 2 x 2 matrices 


oe , where a,b,c,d are 
cd 
integers modulo p, p is a prime number, such that ad — bc # 0; indeed, 
G = GL2(Z,). What is |G|? 
(b) Let H be the subgroup of G of part (a) defined by 


{ ¢ ] fer be= 1 


indeed, H = SL>(Z,). What is |H|? 
20. Let 
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lab 
H= 1 1lc]|la,bcEeZ 
001 


(a) Show that H is a subgroup of SL3(Zs3); 

(b) How many elements does H have? 

(c) Find three subgroups of H of order 9. Are these subgroups of order 9 abelian? 
Is H abelian? 


21. Find the centralizer of i in GL2(R). 


22. LetA = ; a and B = = s be two elements of SL2(Q). Show that 
o(A) = 4, 0(B) = 3, but AB has infinite order. 

23. Let A € Mat,.2(C). Show that there is no matrix solution B € Mato,.2(C) 
to AB — BA = Ij. What can you say about the same problem with A, B € 
Matnxn(C)? 

24. Let A € F. Prove that the matrix E;;(A) is of order p if and only if the charac- 
teristic of F is p. 


7.2 More About Vectors in R” 


Vectors are the way we represent points in two, three, or n dimensional space. A 
vector can be considered as an object which has a length and a direction, or it can be 
thought of as a point in space, with coordinates representing that point. In terms of 
coordinates, we use the notation 
*=[3] 
y 


for a column vector. This vector denotes the point in the xy-plane which is x units 
in the horizontal direction and y units in the vertical direction from the origin. In n 
dimensional space, we represent a vector by 


Let X = [x; x2 ... X,]‘ and Y =[y yo ... yp]! be two vectors in IR”. We define 
the inner product (X, Y) by 


(X,Y) = x1y1) + x2y2 + +++ XnYn- 
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The length of X is given by 


|X|] = VX -X = [x2 43 +--+ x2. 
x 


The vector U = Ix] is called the unit vector in the X-direction. The angle 6 between 


two vectors X and Yis calculated by 
(X,Y) = |X| || ¥ll cos, 


where 0 < 0 < 7. Two vector X and Y are said to be orthogonal if (X, Y) = 0. 


Definition 7.47 Non-zero vectors X;, X2,..., X% in R” form an orthogonal set if 
they are orthogonal to each other, i.e., (X;, X;) = 0 for all i A j. An orthonormal 
set is an orthogonal set with the additional property that all vectors are unit, i.e., 
|X; || = 1 for alli =1,...,k. 


Lemma 7.48 Any orthogonal set S is linearly independent. 


Proof Let X,, X2,..., X be distinct vectors in S and 
Y =c,X1 + 02X2. +--+ +X. 


Then, we have 


k 
(Y, Xj) = DE GX, Xj) = ¢j(Xj, Xj) = cj || XP. 


i=1 


Since (X;, X;) 4 0, it follows that 


cj= a 1<j<k. 
I| X jl 
Thus, when Y = 0, each c; = 0. a 
Gram-Schmidt orthogonalization process: Let X,, X2,..., Xx be any indepen- 

dent vectors. Then one may construct orthogonal vectors V;, V2, ..., Vz as follows: 

Y= Xi, 

(X2, Vi) 
Vy, = X27 - ——_ Vi, 
I|Vi ll? 


j-l 
(Xj, Vi) 
Nee 2 yan 
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Fig. 7.1 The projection of 
U onto V 


ProjvyU 


Then, Vj, V2,..., Vi are orthogonal vectors. 
The vector projection of a vector U on a non-zero vector V is the orthogonal 
projection of U on a straight line parallel to V, see Fig. 7.1. 


Lemma 7.49 The vector projection of a vector U on a non-zero vector V is equal 


to 
ProjyU Ye 
ro] = —7U. 
: VI? 


Proof The projection vector of U onto V is a scalar multiple of V, i.e., ProjyU = 
cV. Since the vector U — cV is perpendicular to V, it follows that (V, U —cV) = 0. 
Hence, we have (V, U) — c(V, V) = 0. This implies that 


(VU) 
c= : 
(Va) 
Thus, we conclude that 
. (V, U) 
ProjyU = ——_-V. (7.6) 
(VV) 


When we multiply a vector by a scalar, it does not matter whether we put the scalar 
before or after the vector. So, by (7.6), we get 


pape 
ro — e 
uy (V, V) 


Since (V, U) = V'U and (V, V) = || VI’, it follows that 


ay ee 
rojyv = = 5 
IV? Vi? 


as desired. |_| 


If we define 
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VV' 


IVP 


then the projection formula becomes ProjyU = PU. The matrix P is called the 
projection matrix. So, we can project any vector onto the vector V by multiplying 
by the matrix P. 


Exercises 


1. (a) Suppose that 6 € R. Show that 
(cos 9, sin@), (—sin0@, cos 0) 


and 
(cos @, sin@), (sin@, —cos 0) 


are orthonormal bases of R?: 
(b) Show that each orthonormal basis of IR? is of the form given by one of the 
two possibilities of part (a). 
2. Suppose that X, Y € R”. Prove that (X, Y) = 0 if and only if 


|X|| < |X +aY]), 


forallaeR. 


3. Find vectors X, Y € R? such that X is a scalar multiple of : i Y is orthogonal 


3 


1 1 
5 ]-ama 3 | =X+Y. 


4. Apply Gram-Schmidt orthogonalization process to the following vectors in R?: 


1 8 0 
2° 13 1 | and 0 
0 —6 1 


Nn 


. Find the angle between a diagonal of a cube and one of its edges. 
6. If C = ||A||B + || BI|A, where A, B, and C are all non-zero vectors, show that C 
bisects the angle between A and B. 
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7.3 Rotation Groups 
A rotation matrix describes the rotation of an object. In this section, we derive the 
matrix for a general rotation. 


Theorem 7.50 Any 2 x 2 orthogonal matrix with entries in R and determinant | 
has the form 


Rot(9) = cos @ tak 


sinO  cos@ 


and represents a rotation in IR* by an angle 6 counterclockwise, with center the 


origin. 
ab 
cd 
be an orthogonal matrix with entries in R. Then, we have 
pp [eee ab|_ | 10 
sa=[3 all eal-[orl: 


So, both columns of A are mutually orthogonal unit vectors. The only two unit 


vectors orthogonal to le are = and ‘| . Hence, one of these must be ek 


Proof Suppose that 


Consequently, there exist two possibilities for A: 


a | a | 
or ; 
c oa c —a 
Since a? + b? = 1, it follows that the first matrix has determinant 1 and the second 
matrix has determinant —1. If we consider the first matrix, then there exists a unique 


angle @ such that cos @ = a and sin @ = b. Hence, we denote the resulting matrix A 
by 


sinO  cos@ 


Rot(0) = cos @ ak 


to emphasis that it is a function of 0. Now, we obtain 


x} | xcos@—ysind 
oe) | a ; oo 


The right side is the image of the vector B under a rotation about the origin by 
angle 0. | 
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Fig. 7.2 The x- and y-axes 
and the resulting x’- and 


y’-axes formed by a rotation 
through an angle 0 
Sal -X 
Fig. 7.3. Rotation with y 
0=7/2 
x 


Indeed, according to Theorem 7.50, ifthe x and y axes are rotated through an angle 
0, then every point on the plane may be thought of as having two representations: 


(1) (x, y) on the xy plane with the original x-axis and y-axis. 
(2) (x’, y’) on the new plane defined by the new, rotated axes, called the x’-axis and 
y’-axis, see Fig. 7.2. 


Example 7.51 In particular, the matrix 
0 -1 
1 O 


is used for 0 = 7/2, see Fig. 7.3. 
The matrix 
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Fig. 7.4 Rotation with y 
06=T7 
x 
Fig. 7.5 Rotation with y 
0 = 37/2 
x 


is used for 9 = 7, see Fig. 7.4. 


The matrix 


is used for 0 = 37/2, see Fig.7.5. 


Lemma 7.52 If A is ann x n orthogonal matrix with det(A) = 1, then 


det(A — I,) = (—1)" det(A — In). 
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Fig. 7.6 A rotation in R3 


Proof We have 
det(A — I,,) = det(A’) det(A — [,) 
= det (A‘(A — I,)) 


= det(J, — A’) 
= det ((I, — A)') 
= det(I, — A) 


= (-1)" det(A — I), 
as desired. |_| 


Corollary 7.53 [f A is a3 x 3 orthogonal matrix with det(A) = 1, then there exists 
a non-zero column vector X € R? such that AX = X. 


Proof In Lemma 7.52, letn = 3. Then, we deduce that det(A — /,,) = 0. This yields 
that A — J, is not invertible. So, (A — J,)X = 0 has a non-zero solution. Conse- 
quently, there exists a non-zero column vector X € IR? such that AX = X. a 


Definition 7.54 A rotation in R? is a matrix R(e,, 0) determined by a unit vector 
er € R} and an angle 0. More precisely, R(ep, 9) has the line through ep as the axis 
and the plane perpendicular to the line is rotated by the angle 0 in the counterclockwise 
direction (see Fig. 7.6). 


Corollary 7.53 can be used to show that any 3 x 3 orthogonal matrix with deter- 
minant | represents a rotation in R? about an axis passing through the origin. 


Theorem 7.55 Each rotation matrix in R? lies in SO3(R). 


Proof Suppose that R(er, 9) is a rotation in R°, where ep is the unit vector in the 
direction of the axis of rotation. By Gram-Schmidt orthogonalization process, we 
can find two more vectors. So, we can consider an orthonormal set {V;, V2, V3} of R? 
such that V3 = er. Therefore, the matrix [V; V2 V3] is orthogonal. Now, by referring 
to Theorem 7.50, this rotation described by 
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cos@ —sin@ 0 
R= R(er,9)=} sin@d cosé O 
0 0 1 


Since R’R = J; and det(R) = 1, we conclude that R € SO3(R). a 
Theorem 7.56 Each matrix in SO3(R) is a rotation. 


Proof Suppose that R € $O3(R) is an arbitrary element. By Lemma 7.53, there 
exists a column unit vector V3 such that RV3 = V3. By Gram-Schmidt orthogonal- 
ization process, we can extend this to an orthonormal set {V;, V2, V3}. Then, the 
matrix A = [V, V2 V3] € $O3(R). Hence, we conclude that RA € SO3(R). We can 


write 
RV, = av, + bV>, 
RV> =cV, + dV, 


RV3 = V3. 
Since 
ab 0O 
A'RA=|cd 0] €SO;(R), 
00 1 
it follows that 
=| = a b 
A RA= és | € SO2(R), 


which means that it is a planer rotation matrix Rot(@). Consequently, we have R = 
R(V3, 8). a 


Corollary 7.57 The set of rotation in R> can be identified with SO3(R). 


Exercises 


1. For any rotation matrix R, show that R’ = R7!. 
2. Use standard trigonometric identities to verify that 


Rot(@)Rot(¢) = Rot(6 + 4). 


Deduce that Rot(29) = Rot(@)? and Rot(—@) = Rot(6)~!. 
—2 
3. Use rotation matrix to find the image of the vector 1 | if itis rotated 
2 


(a) 30° about the x-axis; 
(b) —30° about the x-axis; 
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(c) 45° about the y-axis; 
(d) —45° about the y-axis; 
(e) 90° about the z-axis; 
(f) —90° about the z-axis. 


7.4 Reflections in R2 and R°? 


Reflection matrix is the matrix which can be used to make reflection transformation 
of a figure. 


Theorem 7.58 A reflection across the line y = mx is performed by matrix 


cos2@ sin20 
Ref(#) = sin20 —cos A : 


in terms of an angle @ that mirror makes with the positive x-axis (Fig. 7.7). 


Proof The formula for the distance of P(x, y) from the line mx — y = Ois 


mx — y 


d = —., 
V1+m? 
where m = tan 6. Hence, we have 


xtand—y xtand—y : 
= — =x sind — ycos@. 


1 + tan? 6 sec 0 


So, we conclude that 


Fig. 7.7 A reflection across 
the line y = mx 
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Fig. 7.8 Reflection with y 
06=7/2 


x 
x’ =x —2dsiné = x — 2 sin @(x sin — ycos@) 
= x(1 — 2sin? 0) + 2y sin @ cos @ = x cos 26 + y sin 20 
and 
y’ = y+2dcosé = y+ 2cos (x sin 6 — ycos 6) 
= 2x sinO cos 6 + y(1 — 2cos? 6) = x sin26 — ycos 20. 
Consequently, we obtain 
x’ | | cos26 sin20 x 
y’ | | sin26 —cos20 |] y |’ 
and we are done. | 


Example 7.59 In particular, the matrix 


[10] 


is used for 9 = 7/2, see Fig. 7.8. The matrix 


-1 0 
0 -1 
is used for 9 = 7, see Fig. 7.9. 


Corollary 7.60 The set of reflections in R* is exactly the set of elements in O2(R) 
that do not belong to SO2(R). 


Definition 7.61 A reflection in O3(R) is a matrix A whose effect is to send every 
column vector of R? to its mirror image with respect to a plane S containing the 


7.4 Reflections in R? and R3 177 


Fig. 7.9 Reflection with y 
06=T7 


zx 
origin. More precisely, suppose that 
S= {ax +by+cz=0|a,b,c € R} 

is a plane through the origin. We say a matrix A makes a reflection across S if 
(1) AX = X for all vectors X in S, 
(2) AX = —X for all vectors X perpendicular to S. 
If N is chosen to be a unit column vector perpendicular to S, then 

AX = X —2(X,N)N, (7.7) 


for all X € R?. 


According to (7.7), we obtain 


AX = X—2(X,N)N 


= X¥ —2N(X,N) 
= X¥ —2N(N'X) 
= X —2(NN')X 
= (In —2NN')X. 


The matrix (J, — 2NN"‘) is called reflection matrix for the plane S, and is also 
sometimes called a Householder matrix. 


Example 7.62 Let X‘ = [x; x2 x3]. Table 7.1 describes some reflections of the 
column vector X‘ across some planes. 


Example 7.63 We want to compute the reflection of the column vector 
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Table 7.1 Some reflections of the column vector X‘ across some planes 


Operator Equations defining the image | Reflection matrix 

x, =x+0y4+0z 1 0 0 
Reflection across the xy plane | x5 = Ox +y+0z 0 1 0 

x4 = 0x + Oy —z 0 0-1 

xi =x+0y+0z 1 0 O 
Reflection across the xz plane | x5 = Ox —y +0z 0 -1 O 

x4 = 0x + Oy +z 0 oO-!1 

x, =—x+0y +0z -1 00 
Reflection across the yz plane | x5 =Ox+y+0z 0 10 

x4 = 0x + Oy +z 001 


x= 2 


across the plane 2x — y + 3z = 0. The column vector 


V=]|-1l 
3 


is normal to the plane and ||V||? = (V, V) = 2? + (-1)? + 37 = V/14. So, a unit 
normal vector is 


V 1 


Vil 14 


Consequently, the reflection matrix is equal to 


ly — 2NNt = ly 2VV" 
ei 2 
=h-z|-1][2 -13) 

B. Ot 6 

eee 

ee ee 

“Me, ee Se 

~ ae. 

Ut cas ge 
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Theorem 7.64 A reflection across a plane S can be performed by the matrix 


cos 26 sin20 0 
Ref(@) = | sin2@ —cos26 0 
0 0 0 


Proof Let V, be a unit vector in R?, V> be a unit vector orthogonal to Vj, and let 
V3 =U x V. Then, (V;, V2, V3) is a orthogonal triple. Suppose that A = Ref(@) is 
a reflection in R? across the plane S through the origin whose unit normal column 
vector is NV, and orthogonal to V3. We can find an angle @ such that 


N =-—sin OV, + cos 0V3>. 
Using Eq. (7.7), we obtain 


AV, = (1 — 2sin’ 0)V, + 2 sin @ cos 0V>, 
AV = 2sin6 cos 6V, + (1 — 2cos” 8) Vo, 
AV3 = V3. 


That is, 
AV, = cos 26V, + sin 20V3, 
AV> = sin 20V, — cos 26V3, 
AV3 = V3. 


Therefore, we obtain 


cos 20 sin2é 0 
A=Ref(9) =| sin20 —cos26 0], 
0 0 0 


as required. a 


Exercises 


1. Use standard trigonometric identities to verify that 


(a) Ref (@)Ref(¢) = Rot(2(0 — 4)), 
(b) Rot(@)Ref(¢) = Ref(¢ + 50), 
(c) Ref(¢)Rot(9) = Ref(¢ — 5). 
2. Find a matrix which represents a reflection in the line y = 2x. 


3. Prove that each element of O3(R) can be expressed as a product of at most three 
reflections. 
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Fig. 7.10 A translation in y 
R2 


4. Find the matrix which induces reflection with respect to given vector [a b c]‘ in 
IR?. Check your result for the case [1 0 0]. 


7.5 Translation and Scaling Matrices 


Suppose that 7, is the set of all translations of a fixed point (x1, x2,...,X,) in R”. 
If 
a\ 
a2 
A=]. € Th. 
an 
then 
x} x] ay 
x4 x2 ar 
=]. + 
: 
xX, Xn an 


T,, with addition of columns matrices as the composition rule is a group. This group 
is called the translation group. 


Example 7.65 A translation is shown in Fig.7.10 by A = H € th. 
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Example 7.66 The set of all translations parallel to the x-axis, i.e., all translations 


of the form 
a 
{[o] <8} 


Example 7.67 The set of all translations parallel to the y axis, i.e., all translations 


of the form 
0 
[2] #<R} 


A scaling can be represent by a scaling matrix. To scale an object by a vector 
Vo =[c1 co ... Cy] (Cc) = +++ = Cy # 0), each point X would need to be multiplied 
by the following diagonal matrix: 


is a subgroup of 7). 


is a subgroup of 7). 


Cy 0. 0 

0 C2 0 
A= -_ 

0 0 ...c& 


As shown below, the multiplication give the expected result: 


Ci 0... 0 xX) CixX] 

0 (6 ecare 0 X2 C2X2 
AX=]..... (|= 

0 0 Cn Be CaXi 


Remark 7.68 The set of all diagonal matrices with non-zero determinants in 
GL,,(R) forms a subgroup. 


Such a scaling changes the diameter of an object by a factor between the scale 
factors, the area by a factor between the smallest and the largest product of two scale 
factors, and the volume by the product of all three. 

The scaling is uniform if and only if the scaling factors are equal (vj) = --- = vp). 
If all except one of the scale factors are equal to 1, we have directional scaling. 

In the case where vj = --- = v, = c, scaling changes the area of any surface by 
a factor of c? (see Fig. 7.11) and the volume of any solid object by a factor of c? (see 
Fig.7.12). 
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Fig. 7.11 Scaling with c = 0.5 in R? 


I 


Fig. 7.12 Scaling with c = 2 andc = 0.5 in R? 


Exercises 


1. Prove that every positive rigid motion in R* can be obtained by a rotation about 
a l-axis, followed by a translation along / (this type of motion is called screw). 

2. Consider the graph y = e*. Suppose that the graph is dilated from the y-axis by 
a factor of 3, reflected in the x-axis, and then translated 1 unit to the left and 2 
units down. What is the equation of the resulting graph? 

3. Consider the graph y = x”. Suppose the graph is dilated from the x axis by a 
factor of 2, and then translated 3 units to the right. What is the equation of the 
resulting graph? 

4. Suppose the graph of y = f(x) is transformed by a dilation of factor k from the 
x axis, factor h ¢ 0 from the y axis, and then translated c units to the right and d 
units up. Show that the resulting graph has equation 


y=kt(Gx-e) +a 


5. Two geometric shapes A and B are shown in Fig. 7.13. Shape A has one point at 
(3, 4) and shape B has one point at (0, —2). Calculate a chain of matrices that, 
when post-multiplied by the vertices of shape A, will transform all the vertices 
of shape A into the vertices of shape B, i.e., translate and rotate the shape point 
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Fig. 7.13 Two geometric 
shapes A and B 


(3,4) 


(3, 4) to (0, —2). The transformation must also scale the size of shape A by half 
to shape B. 


7.6 Dihedral Groups 


The dihedral groups form an important infinite family of examples of finite groups. 
They arise as groups of symmetries of the regular n-gons, and they play an important 
role in group theory, geometry, and chemistry. 


Definition 7.69 The dihedral group D, is the group of symmetries of a regular 
polygon with n vertices. 


We may consider this polygon as having vertices on the unit circle, with vertices 
labelled 1, 2,..., — 1 starting at (1, 0) and proceeding counterclockwise at angles 
in multiples of 27/n radians. There are two types of symmetries of the regular n-gon, 
each one giving rise to n elements in the group D,: 


(1) Rotations Ro, Ri,..., Rn—1, where R; is rotation of angle 2k7/n. 
(2) Reflections So, S1,..., Sn»—-1, where S; is reflection about the line through the 
origin and making an angle of k7/n with the horizontal axis. 


The group operation is given by composition of symmetries. Similar to permuta- 
tions, if x and y are two elements in D,, then xy = y o x. That means that xy is the 
symmetry obtained by applying first x, followed by y. 

So, according to our discussion in Sects.7.3 and 7.4, the elements of D, can be 
thought 2 x 2 matrices, with group operation corresponding to matrix multiplication. 
Indeed, we have 


184 
Qk : Qk 
cos sin 
n n 
R= j 
. (2k 2k 
sin {| —— cos {| —— 
n n 
(=) : (= ) 
cos {| —— sin | —— 
n n 
Ss. = 
Qkt 2kr 
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Now, it is not difficult to see that the following relations hold in D,: 


RR; = Ri+;, 
RS; = Si+j, 
SR; — Si-j 
5,9) = Ris, 


where 0 <i, j <n—1, and both i+ j and i — j are computed 


modulo n. The 


Cayley table for D,, can be readily computed from the above relations. In particular, 


we see that Ro is the identity, Re = R,_; and s = §;. 


Example 7.70 In Example 3.39, we investigated D3, the symmetry group of the 


equilateral triangle. The matrix representation of D3 is given by 


1 V3 1 V3 
ro=| 4 Ha R=| 2 7 |,m=| 7 * |, 

J3 1 J3 1 

2 2 2 Pi 

1 V3 1 J3 

m=| 9 2 |. s= 2 2 $= 2 2 
= /3 1 Jf3 1 

7 p} 2 2 


Example 7.71 In Example 3.40, we investigated D4, the 
square. The matrix representation of D4 is given by 


1 0 0 -l -1 O 
m=[o af ma[i of fo -} A 
1 0 0 1 -1 0 
%=| 5 5]: =| a =| 0 |. S3 = 


[=o] 
[-1 
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| . , : =, , 
Fig. 7.14 Examples of polygons 


Theorem 7.72 Let R be a counterclockwise rotation of the n-gon by 27/n radians 
and let S be a reflection across a line through a vertex. See examples in the polygons 
in Fig. 7.14. Then 


(1) The n rotations in D, are I, R, R2,..., R"7'; 
(2) o(S) = 2; 
(3) The n reflections in D, are S, RS, R?S,..., R’'S. 


Proof (1) The rotations J, R, R2,..., R"—! are distinct since R has order n. 

(2) Simply consider what applying s twice to each vertex will do to it. 

(3) Since J, R, R?,..., R”~! are distinct, it follows that S, RS, R?S,..., R"-'S 
are distinct. In addition, for each j, R/S is not a rotation because if R/S = R’, then 
S = R'~/, while S is not a rotation. |_| 


Corollary 7.73 The dihedral group D,, has 2n elements and 
Die, Re ip cseg BOS, RE, 4g OO 
In particular, all elements of D, with order greater than 2 are powers of R. 


Corollary 7.74 The dihedral group D,, is generated by R and S. 


Proof Since every element of D,, is a product of R and S, by Theorem 4.40 it follows 
that {R, S} generate D,. |_| 


Theorem 7.75 In the dihedral group Dy, we have RS = SR™!. 


Proof Since RS is areflection, it follows that (RS)* = I, or equivalently RSRS = I. 
This implies that RS = S~'R—!. Since 0(S) = 2, it follows that S = S~! and so we 
obtain RS = SR7!. a 


Corollary 7.76 For each | < j <n, we have R‘S = SR. 
Proof It is straightforward by mathematical induction. a 
Corollary 7.77 The dihedral group D,, is not abelian if n > 3. 


Theorem 7.78 For each n > 3, the center of the dihedral group Dy, is 
(1) Z(D,) = {1} ifn is odd; 
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(2) Z(D,) = U1, R"/?} ifn is even. 


Proof In order to determine the center of D,, it suffices to determine those elements 
which commute with the generators R and S. Since n > 3, it follows that R~! 4 R. 
If R‘S € Z(D,) for some 1 < k <n, then R(R‘S) = (R*‘S)R. Then, we must 
have 
R't+!s = R(R‘S) = (R‘S)R = R*(SR) = R‘(R“'S) = R'S. 


This implies that R? = I acontradiction. Thus, we conclude that no reflections are 
in the center of D,, since reflections do not commute with R. 

Similarly, if for 1 < j <n, R/S = SR/, then R/S = R~/S. Hence, R7/ = I. 
Since R has order n, it follows that 


RI=I © n2j. 
If n is odd, then 
Ri =16 n\j © jis multiple ofn & R/ =1. 


So, if 1 is odd, then the only rotation that could be in Z(D,,) is TI. 
If n is even, then 
RA=1 & n\2j  (n/2)|j. 


Hence, the only choice for j are 7 = 0 and j = (n/2). So, we have 
Ri = R°=1 or R/ = R"”. 


To show that R”/ is in Z(D,), we check it commutes with every rotations and 
reflections in D,. Clearly, R"/2 commutes with all rotations, since all rotations are 
powers of R. Hence, we check R"/? commutes with each reflections. Indeed, we 
have 

R"!?(R! S)= R@/2D+i § (7.8) 


and : ; : 
(R'S)R"/? = Ri(SR"/?) = Ri R25 


= RiR"2S = Rit@/D 5 = R&/2)+i gg, (7.9) 


From (7.8) and (7.9), we obtain R”/?(R'S) = (R'S)R"/. 
Therefore, we conclude that 
{I} if n is odd 


Z(Dn) = Bd : 
(1, R"/*} if nis even. 
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Theorem 7.79 The conjugacy classes of the dihedral group Dy, are as follows: 
(1) Ifn is odd, 


e the identity element: {I}, 
e (n — 1)/2 conjugacy classes of size 2: {R, R7'}, {R2, R~7}, ..., (ROVE, 
R-@-D/Py 


e all the reflections: {R, RS, R?S, ..., R"—'S). 
(2) Ifn is even, 


e two conjugacy classes of size 1: {I}, {R"/*}, 

e (n/2) — 1 conjugacy classes of size 2: {R, R7'}, {R?, R~7}, 20, {RO/-), 
R-@D—Dy 

e the reflection divided into two conjugacy classes: 


{RS | 0 <k < (n/2) — 1} and {R218 | 0 <k < (n/2) — 1}. 


Proof We know that every element of D,, is of the form R! or R‘S for some integer 
i. Thus, in order to determine the conjugacy class of an element X, we compute 


R-'XR and (R'S)7!X(R'S). 


Since R~'R/R' = R/ and (R'S)~!R/(R'S) = R~!, it follows that the only conju- 
gates of R/ in D, are R/ and R~/. It is necessary the more computation to be sure 
nothing more is conjugate as well. Now, we try to find the conjugacy class of S. We 
have 

RSR = RS, 

(R'S)~'S(R'S) = R7‘S. 


Since 1 <i <n, it follows that R7 and R"~~' run through powers of R divisible by 
2 

Let n be odd. Since 2 is invertible modulo n, it follows that we can solve the linear 
congruence equation 2i = k(mod n) for each 7. This yields that 


{R7S|ieZ}={R‘S|k eZ}. 
Hence, every reflections in D, is conjugate to S. 
Now, let n be even. We only obtain half the reflections as conjugates of S. The 


other half are conjugate to RS. Indeed, we have 


R-(RS)Ri = R15, 
(R'S)“!(RS)(R'S) = R7-'S, 


Since 1 <i <n, this gives us {RS, R38, ..., R"—'S}. | 


Example 7.80 An application of D4. One application of Dg is in the design of a 
letter-facing machine. Imagine letters entering a conveyor belt to be postmarked. 
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They are placed on the conveyor belt at random so that two sides are parallel to 
the belt. Suppose that a postmarker can recognize a stamp in the top right corner of 
the envelope, on the side facing up. In Fig. 7.15, a sequence of machines is shown 
that will recognize a stamp on any letter, no matter what position in which the letter 
starts. The letter P stands for a postmarker. The letters R and S stand for rotating 
and flipping machines that perform the motions of R and S. The arrows pointing up 
indicate that if a letter is postmarked, it is taken off the conveyor belt for delivery. 
If a letter reaches the end, it must not have a stamp. Letter-facing machines like this 
have been designed. One economic consideration is that R-machines tend to cost 
more than S-machines. R-machines also tend to damage more letters. Taking these 
facts into consideration, the reader is invited to design a better letter-facing machine. 
Assume that R-machines cost $1000 and S-machines cost $750. Be sure that all 
corners of incoming letters will be examined as they go down the conveyor belt. 


Exercises 


1. Show that D, is non-abelian group of order 8, where each element of Dy is of the 
form a‘'b/,0<i<3and0<j <1. 

2. Draw the Hasse diagram for subgroups of the dihedral group D4. 

3. In the dihedral group D,, suppose that R is a rotation and that S is a reflection. 
Use the fact that RS is also a reflection, together with the fact that reflections have 
order 2, to show that RS'R is the inverse of S. 

4. For each of the snowflakes in Fig. 7.16 find the symmetry group and locate the 

axes of the reflective symmetry. 

Find the symmetry group of the Iranian architecture in Fig. 7.17. 

6. Design a better letter-facing machine. How can you verify that a letter facing 
machine does indeed check every corner of a letter? Can it be done on paper 
without actually sending letters through it? 

7. Let S be areflection in the dihedral group D,, and R be arotation in D,. Determine 


(a) Cp, (S) when n is odd; 
(b) Cp, (S) when n is even; 
(c) Cp, (R). 


mn 


i a i Deliver 


Letters —> P RP RPRPS PRPRPR P— Reject 


Fig. 7.15 A letter facer 
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Fig. 7.16 Snowflakes 


7.7 Quaternion Group 


Consider the case F = C, the complex numbers, and the set of eight elements 


Shale IL 3 del dh 


One can use the notation 


i 0 0 1 0 i 
ae pee Pa, 


Og = 


—_— 


189 


To avoid confusion, we may show the identity matrix by | instead of 5. Then, we 


obtain 
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Fig. 7.17 Iranian architecture 


Fig. 7.18 The rules 

involving 7, J, and K in the 

quaternion group I J 
K 


Palak ==1, 
Ija=J1 =k, 
JK =—-JK =1, 
KI=-IK=J. 


So, Og = {+1, +/, +J/, +K}. 

The rules involving J, J and K can be remembered by using Fig. 7.18. 

Going clockwise, the product of two consecutive elements is the third one. The 
same is true for going counterclockwise, except that we obtain the negative of the third 
element. This group was invented by William Hamilton in 1834. The quaternions are 
used to describe rotations in three-dimensional space, and they are used in physics. 
The quaternions can be used to extend the complex numbers in a natural way. 


Theorem 7.81 Qg is a subgroup of GL2(C) and so in particular Qs is a group of 
order 8. 


Proof Clearly, Qg is non-empty. By the above relations, it is closed under multi- 
plication. In addition, +1 are their own inverse, and all the other elements have an 
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inverse which is minus themselves (since /(—/) = 1, etc.). Therefore, Q¢ is closed 
under inverse. | 


Qs is called quaternion group. The quaternion group is not abelian. Its Cayley 
table is shown below. 


Theorem 7.82 The quaternion group Qs is generated by J and K. 


Proof We observe that each element of Qg is of the form J’ K* for some integers r 
and s. a 


Remark 7.83 Note that /, J and K have order 4 and that any two of them generate 
the entire group. 


The proper subgroups of Qg are (J), (J), (K) and (—1). We have 


and the center of Qg is (—1). 


Exercises 


1. Show that Qs has exactly one element of order 2 and six elements of order 4. 
2. Draw the Hasse diagram for subgroups of the quaternion group Qs. 


7.8 Worked-Out Problems 


Problem 7.84 Let 7 be a positive integer and 
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Prove that A” = J; if and only if 4|n. 


Solution First, we compute A’, A?, A‘, etc. We find that 


00-1 00-1 -1 0-1 
A*=| 01 0 010 f= 01 Of, 
10 0 10 0 0 0-1 
-1 0 O 0 0-1 001 
A= 01 0 010 4;= 010), 
0-1 10 0 -1 00 
001 0 0-1 100 
At = 010 01 OJ=] OLO0/=8 
-1 00 10 0O 001 


Now, if 4|n, then we can write n = 4g. Hence, A” = A“f = (A*)2 = ie =]. 
Conversely, if A” = 1/3, then by the division algorithm, there exist integers g and 
r such thatn = 4q +r with 0 <r < 4. Next, we obtain 


Al = A" = A"(A~4)1 = If = Ih. 


Since A, A? and A? are not equal to /;, it follows that r = 0. Consequently, we obtain 
Aln. a 


Problem 7.85 Let 7 be a positive integer and 


110 
A=] 011 
001 
Compute A”. 
Solution Suppose that 
010 
B=; 001 
000 


Then, we observe that B” = 0, for each n > 3. Note that A = J; + B. Since J; and 
B commute, we can use the binomial theorem. So, we can write 


A" = (5 +B)" =~ (Fg. 


i=0 
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Therefore, we obtain 


0 n(n — 1) 
ete et pel 
— 3 2 ~1 01 n ; 
0 0 1 
and we are done. |_| 


7.9 Supplementary Exercises 


1. Prove that the elements of GL,(R) which have integer entries and determinant 
equal to | or —1 form a subgroup of GL, (R). 

2. Let A= ; e ,B= i =] € GL2(Z). Show that A has order 4, B has 
order 3 and AB has infinite order. 


3. The matrix 


I ae aera 
LS ae naerd, 
2 
Dt te tea th 


is called a Vandermonde matrix. Show that such a matrix is invertible, when fo, 
ti, ...,t, aren + 1 distinct elements of C. 


4. Show by example that there are matrices A and B for which lim A” and lim B” 
noo noo 


both exist, but for which lim (AB)"” does not exist. 
n> Oo 


5. Let A= ‘ : be a matrix over a field F. Then, the set of all matrices of the 


form p(A), where p is a polynomial over F, is acommutative ring R with identity. 
If Bisa2 x 2 matrix over R, the determinant of B is then a2 x 2 matrix over F, 
of the form p(A). Suppose that B is the 2 x 2 matrix over R as follows: 


A-— aly —blh 
—clh A- dl, , 


Show that det(B) = p(A), where 
D(x) = x? — (a +d)x + det(A), (7.10) 


and also that p(A) = 0. The polynomial p in (7.10) is the characteristic polyno- 
mial of A. 


6. Given a matrix A = eo 


oa in GL2(Z,). Consider its characteristic polynomial. 
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(a) If the roots A; and Az are distinct in Z,, show that A is conjugate to both 
A, 0 rA2 0 
0 Mm and | 0 ai 
_ . A, 0 ae : Ay 1 |. 

(b) If Ay = Az, show that A is 0 > or it is conjugate to 0 al in 

GL2(Z,). 

7. Let 
Ti 112 113 


R=] ray ro 123 
31 132 133 


be a rotation matrix in R?. Show that the rotation axis is the vector 


1 132 — 123 
S| 13 — 731 

sin 0 
21 — 112 


If the angle of rotation 6 is different from 7 and 0, then 


| ee (2 + ro. +1733 — -) 
= 5 ; 


What happens if the rotation angle is very small? Describe a robust method to find 
the rotation axis (your method should include the cases for 6 = 0 and 6 = 7). 


Chapter 8 M®) 
Cosets of Subgroups and Lagrange’s cies 
Theorem 


In group theory, the result known as Lagrange’s theorem states that for a finite group 
G the order of any subgroup divides the order of G. Lagrange’s theorem is one of 
the central theorems of group theory. In order to prove this theorem we introduce the 
notion of cosets of a subgroup. 


8.1 Cosets and Their Properties 


To understand Lagrange’s theorem, one has to look at cosets of a subgroup. This 
notion was invented by Galois in 1830, although the term was coined by G.A. Miller 
in 1910. 


Definition 8.1 Let G be a group and H be a subgroup of G. For any a € G, the set 
aH = {ah | h € H} is called the left coset of H in G containing a. Analogously, 
Ha = {ha |h € H} is called the right coset of H in G containing a. 


In other words, a coset is what we get when we take a subgroup and shift it 
(either on the left or on the right). The best way to think about cosets is that they are 
shifted subgroups, or translated subgroups. Note that a lies in both aH and Ha, since 
a = ae = ea. If the left and right cosets coincide or if it is clear from the context 
to which type of coset that we are referring to, we will use the word coset without 
specifying left or right. In additive notation, we get a + H (which usually implies 
that we deal with a commutative group where we do not need to distinguish left and 
right cosets). Any element of a coset is called a representative of that coset. 


Example 8.2 Let G = Z, the additive group of integers, and H = (m) = mZ, the 
subgroup generated by m, for some m € Z. The set of left cosets is {H, 1+ H, 2+ 
HH, ..., m—1+H}.Sincea+ H = H +a, left cosets coincide with right cosets. 
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Example 8.3 Let G = S3, the symmetric group of an equilateral triangle. Take the 
subgroup H = {id, (1 3)}. Then, the left cosets are 


(12)H = {(1 2), (1 2)(1 3)} = {01 2), 1 23)} = 123), 
(1 3)H = {(1 3), (1 3). 3)} = {C1 3), id} = A, 
(23)H = {(23), (23) 3)} = {(23), 13 2)} = 13 2)H. 


Example 8.4 If G = GL,(R) and H = SL,(R), then for any matrix A € G, the 
coset AH is the set of all 2 x 2 matrices with the same determinant as A. Thus, the 


coset ; His the set of all 2 x 2 matrices of determinant 3. 
Theorem 8.5 (Properties of Cosets) Let G be a group and H be a subgroup of G. 
Then, 


(1) a€ aH; 

(2) aH = H ifand only ifa € H; 

(3) (ab)H = a(bA) and H (ab) = (Ha)b; 

(4) aH = DH ifand only ifa € bH; 

(5) aH = bH oraHN bH =§; 

(6) aH = DH if and only ifa~'b € H; 

(7) \aH| = |bH|; 

(8) aH is a subgroup of G if and only ifa € H. 


Proof (1) We have a = ae € {ah |h € H}=aH. 

(2) Suppose thataH = H.Then, a = ae ¢ aH = H. Conversely, leta ¢ H. We 
show thataH C H and H C aH. The first inclusion follows immediately from the 
closure of H. In order to show that H C aH, assume that h € H is an arbitrary 
element. Since a € H andh € H, it follows that a~'h € H. Consequently, we get 
h=eh = (aa~')h =a(a"'h) € H. 

(3) This follows directly from (ab)h = a(bh) and h(ab) = (ha)b, for allh € H. 

(4) If aH = DH, then a = ae € aH = DH. Conversely, if a € bH, then there 
exists h € H such that a = bh. So, we deduce that aH = (bh) H = b(hA) = bH. 

(5) This property follows directly from (4). Because ifaH NbH # Q, then there 
exists c € aH 1 bH. This implies that cH = aH andcH = bH,andsoaH = bH. 

(6) We observe that aH = bH if and only if H = a~'bH. Now, the result follows 
from (2). 

(7) In order to show that |aH| = |bH|, itis enough to define a one to one function 
from aH onto bH. For this, we define f : aH — bH by f (ah) = bH. Obviously, 
f is onto. Moreover, from the cancellation law, f is one to one. This shows that f 
is a one to one correspondence. 

(8) Assume that aH is a subgroup of G. Then, it contains the identity 
e. So, we get aH NeH £ &. Then, from (5) we have aH = eH = H. Now, from 
(2), we conclude that a € H. Conversely, if a © H, then again by (2), we obtain 
aH = H. | 


Remark 8.6 In Theorem 8.5, analogous results hold for right cosets. 
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Theorem 8.7 If H and K are two subgroups of a group G, then for any a,b € G 
eitheraHN bK =%oraH NbK =c(H1K), for somec € G. 


Proof Suppose that aH NbK #49, and letc eaHNbK. Since c €cH andceé 
cK, it follows thatc € aH (cH andc € bK McK. Consequently, we obtainaH = 
cH andbK =cK,andsoaH NbK =cH ‘cK. Moreover, it is not difficult to see 
that c(H 1 K) C cH (cK. On the other hand, if x € cH N cK, then x = ch = ck, 
for some h € H and k € K. This implies that h = c7'x =k € HONK. So, x € 
c(H 1 K). Therefore, we conclude thataHN bK =cHNcK =c(H(K). | 


Theorem 8.8 Let G be a group and H be a subgroup of G. If R is the set of distinct 
right cosets of H inG and L is the set of distinct left cosets of H in G, then |R| = |L|. 


Proof Consider the mapping f : R — L defined by f(Ha) = a7'H. First, we need 
to show that f is well defined. Namely, if we take two different representations of the 
same right coset we must show our mapping sends them to the same image. Suppose 
that Ha = Hb. Then, we know that Hab! = H orab™! € H.Letab7! = h, where 
h € H. Then, a~'h = b™! and so a~'hH = b“'H, or equivalently a~'H = b“'H. 
Consequently, we have f(Ha) = f (Hb) as desired. Therefore, f is well defined. 
Now, assume that f(Ha) = f(Hb). Then, a-'H =b"'H. This yields that 
ba~'H = H, or equivalently ba~! € H. If ba~' = h, where h € H, then b = ha. 
Consequently, Hb = Hha and so Ha = Hb. Thus, f is one to one. In addition, if 
aH é€ Lisan arbitrary element, then f (Ha"') = (a_')~'H = aH. This shows that 
f is onto. Therefore, f is a one to one correspondence between £ and R. a 


Exercises 


1. List the left and right cosets of the subgroup {id, (1 2)(3 4), (1 3)(24), d YQ 3)} 
in Aq. 
2. If G is a group and {H; | i € J} is a family of subgroups of G, show that 


3. Suppose that G = Qg and let H = (—1) and K = (J) be subgroups of G. Find 


(a) the left cosets of H and K in G; 
(b) the right cosets of H and K inG. 


4. Suppose that H and K are subgroups of G and there are elements a and b in G 
such that aH C bK. Prove that H C K. 

5. Let G be a group and a € G. If o(a) = 30, how many cosets of (a*) in (a) are 
there? List them. 

6. Give an example of a group G having a subgroup H and two elements a, b such 
that Ha = Hb butaH 4 bH. 

7. Find an example of a subgroup H of a group G and elements a and b in G such 
thataH £ HbandaHN Hb #%. 
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8. Suppose that H and K are subgroups of a group G with H < K. Show that for 
eacha € G,eitheraH C K oraHN K =@. 


8.2 Geometric Examples of Cosets 


When a group is defined in terms of vectors and matrices, we can often get a picture 
of the group and its cosets. 


Example 8.9 Let G = R? = {xi + yj | x, y € R}, the additive group of vectors 
in a plane, where i = (1,0) and j = (0, 1). Suppose that H = {xi | x € R} and 
v = ai + bj is a vector in G. Then, the left coset of H in G containing v is 


v+H ={(a+x)i+bj |x € R}. 


This is the line parallel to H that passes through the endpoint of v. The left cosets of 
H in G, in general, are the lines parallel to H. Two parallel lines are either equal or 
disjoint, so any two left cosets of H in G are equal or disjoint. In Fig. 8.1, the cosets 
H in G containing v and v’ are equal while those of u, v, and w are disjoint. 


Example 8.10 Let C* be the group of non-zero complex numbers (see Example 
3.36), and let H = {a+ bi | a*> +b? =1, a,b € R} be the subgroup of C*, which 
is defined in Example 3.66. First, notice that geometrically H is a circle of radius 1 
with center (0, 0). Let 4 + 37 be an element of C*. We want to study the coset of H 
in C* containing 4 + 37. Of course, C* is abelian, so there is no distinction between 
the left and the right cosets, and they are the same. The coset of H in C* containing 
4+ 3i is of the form (4+ 37)H, and it is a circle with center (0, 0) too. However, 
4+ 3i scales the equation so that 1 becomes 4? + 3? = 25. In order to see this, 


Fig. 8.1 The cosets of 
v+ A inR* 
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Fig. 8.2. H and its cosets 
2H and 4H in C* 


we have (4 + 3i)(a+ bi) = (4a — 3b) + Ga + 4b)i. So, the multiplication 4 + 37 
sends the point (a, b) € H to the point (4a — 3b, 3a + 4b). Now, this gives us the 
equation 


(4a — 3b)* + (3a + 4b)? = 16a? + 9b? — 24ab + 9a? + 16b? — 24ab 
= 25a? + 25b? = 25(a* + b*) = 25. 


Now, we consider the general case. If z € C* be an arbitrary number, we can write 
z = re’, where r is the absolute value of z and @ is the argument of z. Consider e!’ H. 
Multiplying any complex number by e’® simply rotates anticlockwise through angle 
6 about the origin. Hence, we deduce that e'? H = H. So, we obtain zH = rH. What 
does multiplying by r do? It scales the circle H by a factor of r. Two different positive 
real numbers r 4 r’ give different cosets rH #4 r’H, since the first has radius r and 
the second has radius r’. For illustration, see Fig. 8.2. Therefore, the cosets of H in 
C* are the circles centered at the origin of positive radius. 


Example 8.11 Let G = {(a, b)| a,b € Randa > 0} be the group defined in Exam- 
ple 3.42 with the following binary operation: 


(a, b) x (c,d) = (ac, bc +d), 
for all (a, b), (c,d) € G. Suppose that 


H=({(1,y)|yeR}, 
K = {(x,0) |x € Rand x > 0}. 


It is easy to check that H and K are subgroups of G. These subgroups are indicated in 
Fig. 8.3. Now, let (a, b) € G. What are the left and right cosets of H in G containing 
(a, b)? We observe that 
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Fig. 8.3. Subgroups H 
and K 


Fig. 8.4 Left and right 
cosets of H in G containing 
(a, b) 


(a,b), y) = (a,b + y) and (1, y)(a, b) = (a, ay + 5), 


for all (1, y) € H. Clearly, the numbers b + y and ay + b run over R. This yields 
that the left and the right cosets of H in G containing (a, b) are the same. In other 
words, we have 

(a, b)H = H(a,b) = {(a,z)|z € R}, 


and this is the vertical line parallel to H passing through the point (a, b), see Fig. 8.4. 
Now, we determine the right coset of K in G containing (a, b). We have 


(x, O)(a, b) = (xa, b), 


for all (x, 0) € K. Since x > 0, it follows that (xa, b) runs through all points of the 
form (z, b) with z > 0. Consequently, we can write 
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(a, b)K 


ik 


Fig. 8.5 Left and right cosets of K in G containing (a, b) 


K(a, b) = {(z, b) | z € Rand z > O}. 


In view of this, the right coset of K in G containing (a, b) is determined by b alone 
and it is independent of the choice of a. So, it is horizontal line through (a, b). See the 
left hand picture in Fig. 8.5. Finally, we try to find the left coset of K in G containing 
(a, b). We have 

(a, b)(x, 0) = (ax, bx), 


for all (x, 0) € K. If we take ax =r, then bx = br/a. So, we can write 
b 
(a,b)K = \(r,5)|r,s €R, r>0, ands = -r}. 
a 


This is a half line out of the origin with slope b/a. See the right hand picture in 
Fig. 8.5. We observe that the left and right cosets of K in G containing (a, b) are not 
the same if b A 0. 


Example 8.12 We know that G = R? forms a group under the following binary 
operation: 


(x1, Y1, 21) + 2, ya, Z2) = 1 + X2, V1 + V2, 21 + 22). 
If we consider the subset 
H = {(x, y,z) | 2x +7y — 3z = 0}, 
then we observe that H is a subgroup of G. In geometrical terms, H is a plane through 
the origin in a three-dimensional space. The left coset (a, b, c) + H is, in general 


terms, nothing more than the plane P passing through the point (a, b, c) and parallel 
to H in R°. For illustration, in Fig. 8.6 we consider (a, b, c) = (0, 2, 0). 
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Fig. 8.6 The left coset 


Zn 
(0, 2,0) + H is the plane P 
passing through the point 
(0, 2, 0) and parallel to H 
H Fad 
? 
O (0, 2, 0) y 


Exercises 


1. In R* under component addition, let H = {(x, 5x) | x € R}, the subgroup of all 
points on the line y = 5x. Show that the left coset (3, 2) + H is the straight line 
passing through the point (3, 2) and parallel to the line y = 5x. 

2. Consider the additive group R and its subgroup Z. Describe a coset t + Z geo- 
metrically. Show that the set of all cosets of Z in R is {t+ Z| 0 < t < 1}. What 
are the analogous results for Z* C R?? 

3. Show that the function sine assigns the same value to each element of any fixed 
left coset of the subgroup (2:7) of the additive group R of real numbers. Thus sine 
induces a well-defined function on the set of cosets; the value of the function on 
a coset is obtained when we choose an element x of the coset and compute sin x. 


8.3. Lagrange’s Theorem 


Let H be a subgroup of a group G, which may be of finite or infinite order. We 
reconsider some parts of Theorem 8.5 in a different view. We exhibit two partitions 
of G by defining two equivalence relations, =; and =z on G. 


Theorem 8.13 Let H be a subgroup of G. Let the relation =, be defined on G by 
a=, b(mod H) @a7'be H. 


Also, let the relation =r be defined on G by 
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a =p b(mod H) Sab"! € H. 


Then, =; and =p are both equivalence relations on G. 


Proof We show that =, is an equivalence relation, and leave the proof for =p as 
exercise. We must verify the following conditions, for all a, b,c € G, 


(1) a =; a(mod #); 
(2) a =, b(mod A) implies b =; a(mod HA); 
(3) a =, b(mod H) and b =; c(mod H) imply a =; c(mod #). 


Now, we prove the above items. 

(1) Since H is a subgroup, it follows that a~!a = e ande € H, which is what we 
were required to demonstrate. 

(2) Suppose that a =; b(mod H). Thena~'b € H. Since H is a subgroup, it fol- 
lows that (a~'b)~! € H. But (a~'b)“! = b“!(a~!)-! = ba. Hence, b-'a € H 
and b =; a(mod #). 

(3) Finally, we require that a =; b(mod H) and b =; c(mod A) force a =; 
c(mod H). Indeed, we have a~'b € H and b-!ce H. Again, since H is a sub- 
group, we conclude that (a~'b)(b-!a) = a~'c € H, from which it follows that 
a =, c(mod A). a 


If [a]; and [a]r are the equivalence classes containing a related to =, and =r, 
respectively, then we have [a], = aH and [a]r = Ha. 
The left cosets of H in G define a partition of G, i.e., 


(1) For eacha € G,aH £#9@; 


(2) Foranya,b € G,aH = bH oraHN bH = 9; 
(c) G= U ad. 


acG 


We can now prove the theorem of Lagrange. 


Theorem 8.14 (Lagrange’s Theorem) Let H be a subgroup of a finite group G. Then 
the order of H is a divisor of the order of G. 


Proof Let us start by recalling that the left cosets of H forms a partition of G. Since 
G is finite, it follows that there exists a finite number of disjoint left cosets, namely 
a,H, a2H,..., a,H. So, we have 


G=a,HUaHU---Ua,H anda;H Na;H = 9, for alli ¥ j. 


Let us look at the cardinality of G. We can write 


n 
|G| = |a,H U@HU---Ua,H| => ° |a:H\. 


i=1 


By Theorem 8.5 (7), the cosets of H in G all have some size of H, i.e., |a; 1| = |H|, 
for all 1 <i <n. This yields that 
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IG) =|H| + Al +---+|4] =nlAI. 
——— — 


n times 
This shows that the order of H is a divisor of the order of G. |_| 


There are several corollaries of Lagrange’s theorem. 


Corollary 8.15 [f G is a finite group and a € G, then o(a)||G}. 


Proof Since (a) is a subgroup of G, then the order of (a) is a divisor of order G. By 
Corollary 4.27, we know that o(a) = |(a)|. This shows that o(a)||G|. a 


Corollary 8.16 Jf G is a finite group anda € G, thena'"! = e. 


Proof By Corollary 8.15, we have o(a)||G|. Thus, there is a positive integer g such 
that |G| = o(a)q. Consequently, we have 


qQiGl = qed = (a?™)4 = el =e, 


as desired. |_| 


Corollary 8.17 (Euler’s Theorem) Ifa and n are positive integers such that (a, n) = 
1, then a? = 1(mod n). 


Proof Let U,, be the group of units of Z, under multiplication modulo n. By Corollary 
4.7, we have |U,,| = y(n). So, by Corollary 8.16, we have a?” = 1, for alld € U,, 
which in turn translates into n|ja?) — 1, or equivalently a?” = 1(mod n). a 


Remark 8.18 In Problem 1.74, we realized a proof for Fermat’s little theorem. Since 
¢(p) = p — 1, for any prime p, we can conclude Fermat’s little theorem from Euler’s 
theorem too. 


Corollary 8.19 Every group of prime order is cyclic. 


Proof Assume that G is of prime order p, and let a € G be an element different 
from the identity. Then, the cyclic subgroup (a) of G generated by a has at least two 
elements, a and e. But by Lagrange’s theorem, the order m > 2 of (a) must divide 
the prime p. Consequently, we must have m = p and (a) = G. This shows that G 
is cyclic. | 


Corollary 8.20 Let G be a group and p be a prime number. If G has exactly r 
subgroups of order p, then it has r(p — 1) elements of order p. 


Proof In each subgroup of order p, all non-identity elements have order p. In addi- 
tion, an element of order p generates a subgroup of order p. By Lagrange’s theorem, 
the intersection of distinct subgroups of order p is trivial subgroup, so their non- 
identity elements are disjoint from each other. Consequently, each subgroup of order 
p has its own p — 1 elements of order p, not shared by the other subgroups of order 
p. Therefore, the number of elements of order p is r(p — 1). a 
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The following natural question arises: Is the converse of Lagrange’s theorem true? 
That is, if d is a divisor of the order of G, then does G necessarily have a subgroup 
of order d. The standard example of the alternating group Ay, which has order 12 
but has no subgroup of order 6, shows that the converse of Lagrange’s theorem is not 
true. 


Theorem 8.21 The group A4 of order 12 has no subgroups of order 6. 


Proof To verify this, recall that A, has eight elements of order 3 and suppose that 
His a subgroup of order 6. Let o be any element of order 3 in Ay. If o is not 
in H, then Ag = H UodH. But then o? € H or 02 EoH. If o” € A, then so is 
(a7)? = o* =o, so this case is ruled out. If 0? € oH, then o* = o/h, for some 
h € H, but this also implies that o ¢ H. This argument shows that any subgroup of 


A, of order 6 must contain all eight elements of A, of order 3, which is absurd. 1 
Theorem 8.22 [fn > 4, then A, has no subgroup of order n\/4. 


Proof If H is a subgroup of order n!/4, then H has only two left cosets in Ay. 
So, if o is a cycle of length 3 in A,, then the cosets H, oH, and oH cannot 
be all distinct. Equality of any two of the above cosets implies either o € H or 
o? € H. Now, o” € H implies o = o4 € H. Thus, H contains all cycles of length 
3. Now, since A, is generated by cycles of length 3, it follows that H = A,, a 
contradiction. a 


Remark 8.23 Theorem 4.31 expresses that the converse of Lagrange’s theorem 
holds for any finite cyclic group. 


Theorem 8.24 Let G be a group of order n. If G has at most one cyclic subgroup 
of order d where d\n, then G is cyclic. 


Proof For each divisor d of n, let 6g(d) denote the number of elements of G of 
order d. For a given positive integer n such that d|n, let 6g(d) 4 0. Then, there 
exists a € G of order d which generates a cyclic group (a) of order d of G. We prove 
that all elements of G of order d belong to (a). If x € G is an element such that 
o(x) =d and x ¢ (a), then (x) is another subgroup of order d such that (a) € (x). 
This contradicts the hypothesis. Consequently, if 0g(d) 4 0, then 6g¢(d) = g(d), 
for all positive integer d|n. In general, we can write 6g(d) < ¢(d), for all positive 
integer d|n. So, we obtain 


n=) 6c(d) < )) pd) =n. 


d\n d\n 


This yields that 0¢(d) = g(d), for all dln. In particular, we have 0g(n) = 
y(n) > 1. Hence, there exists at least one element of G of order n. This shows 
that G is cyclic. | 
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Exercises 


1. Show that a group with at least two elements but with no proper non-trivial 
subgroups must be finite and of prime order. 

2. Suppose that H and K are unequal subgroups of a group G, each of order 16. 
Prove that 24 <|H UK| < 31. 

3. Fora group G and a subgroup H of G, let Ha; U Hay U--- U Ha, be adecom- 
position of G into disjoint right cosets of H in G. Show that a,|H Ua, 'HU 
---Ua,'H is a decomposition of G into left cosets of H in G. 

4. Let G be a finite group and let H and K be subgroups with relatively prime 
order. Prove that HM K = fe}. 

5. Show that if H and K are subgroups of a group G, and have orders 56 and 63, 
respectively, then the subgroup H 1 K must be cyclic. 

6. Suppose that K is a proper subgroup of H and H is a proper subgroup of G. If 
|K| = 42 and |G| = 420, what are the possible orders of H? 

7. Use Fermat’s little theorem to show that if p = 4n + 3 is prime, there is no 
solution to the equation x* = 1(mod p). 

8. If G is a finite group with fewer than 100 elements and G has subgroups of 
orders 10 and 25, what is the order of G? 

9. Let G be a group anda, b € G. If a> = e and aba“! = b’, find o(b) if b # e. 

10. Does As contain a subgroup of order m for each factor m of 60? 

11. Let G be an abelian group of order 2n with n odd. Prove that G has precisely 
one element of order 2. 

12. Let G be a finite group. Show that the following conditions are equivalent: 


(1) G is cyclic; 

(2) For each positive integer d, the number of a € G such that a“ = ¢ is less than 
or equal to d; 

(3) For each positive integer d, G has at most one subgroup of order d; 

(4) For each positive integer d, G has at most g(d) elements of order d. 


13. Let G be a finite group and H be a subgroup of G. Let /(a) be the smallest 
positive integer m such that a” € H. Prove that /(a)|o(a). 

14. Let n be a positive integer and let m be a factor of 2n. Show that D, contains a 
subgroup of order m. 

15. Using Lagrange’s theorem, show that the binomial coefficient 


n = n! 
i) ~ min — my!’ 


for every integers m and n withn > 1 and 0 < m < n, is an integer. 
Hint: Consider a subgroup of S$, of order m!(n — m)!. 
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8.4 Index of Subgroups 


A special name and notation have been adopted for the number of left (or right) 
cosets of a subgroup in a group. 


Definition 8.25 Let H be a subgroup of a group G. The number of distinct left (or 
right) cosets of H in G is the index of H in G. The index of H in G is denoted by 
[G: H]. 


Note that if G is finite, then the index [G : H] divides |G|. 


Example 8.26 Consider Example 8.3, and let H = {id, (1 3)} be the subgroup of 
S3. Then, we observe that [$3 : H] = 3. 


Example 8.27 A left coset of SL2(F) in GL2(F) has the form X SL2(F)={XA|A € 
SL>(F)}, where X € GL>(F).Ifdet(X) = c, thendet(X A) = c. Therefore, all matri- 
ces in X SL>(F) have the determinant equal to c. Conversely, let B € GL2(F) such 
that det(B) = c. Take A = X~'B, then B = XA. On the other hand, det(A) = 
det(X~!B) = det(X~') det(B) = c~'c = 1. This means that A € SL2(F). So, we 
deduce that B € XSL2(F). Consequently, we have XSL2(F) = {A | A € GL2(F) 
and det(A) = c}. Hence, each left coset of $L.(F) in GL(F) has the above descrip- 
tion, for some non-zero elementc € F. This shows that [GL2(F) : SL2(F)] is infinite. 


Theorem 8.28 If H and K are subgroups of a group G such that K < H, then 
[G:K]=[G: A][H: K]. 


Proof Suppose that G = |) a;H, the union of distinct left cosets of H in G, where 
ie] 
1} =[G: H], and let H = U b;K, the union of distinct left cosets of K in H, 
jes 
where |J| = [H : K]. Then, we can write 


6=Uatt= Ua (Yor) = LU ajb)K. 


iel iel jes Gi, jyelxJ 


Now, we claim that a;b;K’s are distinct. Assume that ajb; K = a,b; K, for some 
i,k € Tandj,l € J.Sinceb;K C HandbK C H, itfollows thata;H Na,H # 9. 
This implies that a; H = a, H, and soi = k. Then, we obtain b; K = b, K, and hence 
j =!. Therefore, we deduce that a;b; K’s are distinct left cosets of K in G. This 
yields that[G: K] =[G: A][H: K]. | 


Theorem 8.29 (Poincaré Lemma) If H and K are two subgroups of finite index in 
a group G such that [G : H] =m and [G: K] =n, then H( K is also of finite 
index in G and [m,n] <[G: HM K] < mn. In particular, ifm and n are relatively 
prime, then[G : HM K] = mn. Moreover, if G is finite, thenG = HK. 
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Proof Let a,H,a.H,...,d,H and b,K, bo K,...,b,K be distinct left cosets of H 
and K in G, respectively. Now, by Theorem 8.7, forevery 1 <i<mandl<j<n 
eithera;H 1b;K = @orajH1b;K =c;j(H 0 K), forsome c;; € G. On the other 
hand, x(H 1 K) = xH 1 xK. Therefore, each left coset of H M K is determined by 
intersection of a left coset of H and a left coset of K in G. Consequently, distinct 
number of left cosets of HM K is at most equal to mn. 

Now, by Theorem 8.28, we can write 


IG: HOK\=(G:A\J[H:HOK)=m[(A: AK], 
IG: HOK|=(G: K|[K:HOK)=n[K:ANK, 


and hence m|[G : HM K]andn|[G : HN K]. Therefore, we conclude that the least 
common multiple of m and n divides [G : HN K]. | 


Corollary 8.30 In Theorem 8.29, if m and n are relatively prime, then [G: HM 
K]=man. 


Proof It is straightforward. | 


Theorem 8.31 /f H and K are subgroups of a group G, then 
[H:HOK)<([G: K]. 


Proof Suppose that A is the set of all disjoint left cosets of HM K in H and B 
is the set of all disjoint left cosets of K in G. We define f : A > B by f(h (AN 
K)) =hK, for al he H. If h(HN K) =h'(H 14K), for some h,h' € H, then 
h-'h' © HONK CK. This implies that hK = h'K. So, we conclude that f is well 
defined. Moreover, if hK = h'K, for some h, h' € H,thenh-'h' € K. Since H isa 
subgroup andh, h' € H, it follows that h—'h’ € H. Hence, wehaveh7'h' ¢e HN K, 
which implies that h(H N K) = h’'(H NN K). Therefore, f is one to one. This yields 


this |A| < |B], or equivalently [HW : HN K] <[G: K]. | 
Exercises 
1. If G is an infinite cyclic group and {e} 4 H < G, prove that [G : H] is finite. 


2. Show that the integers have infinite index in the additive group of rational numbers. 
3. Suppose that G is a finite group and H, K are subgroups of G such that K C H 
and [G : K] is prime. Prove that H = G. 
4. Determine the index of the following subgroups in the corresponding groups: 
(a) {0, 3, 6, 9} in Zi; 
(b) Rin (C, +); 
(c) 3Z in Z. 
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5. Show that if H is a subgroup of index 2 in a finite group G, then every left coset 
of H is also a right coset of H. 

6. Two subgroups H and K ofa group G are said to be commensurable if H 0 K is 
of finite index in both H and K. Show that commensurability is an equivalence 
relation on the subgroups of G. 


8.5 A Counting Principle and Double Cosets 


Let A and B be two subsets of a group G. The set AB = {ab | ae A, be B} 
consisting of the products of elements a € A and b € B is said to be the product of A 
and B. The associative law of multiplication gives us (AB)C = A(BC) for any three 
subsets A, B, and C. The product of two subgroups is not necessarily a subgroup. 
We have the following theorem. 


Theorem 8.32 Let H and K be two subgroups of a group G. Then, the following 
two conditions are equivalent: 


(1) The product HK is a subgroup of G; 
(2) HK=KH. 


Proof Suppose that H K is a subgroup of G. Then, for any h €¢ H andk € K, we 
have h—'k—-! € HK and so kh = (h-'k~')"! € HK. Thus, KH C HK. Now, if 
x is any element of HK, then x-!' =hk € HK, for some h € H andk € K. So, 
we obtain x = (x~!)~! = (hk)“! = k“'h! € KH. This means that HK C KH. 
Thus, we conclude that HK = KH. 

On the other hand, suppose that HK = KH, 1.e., if he H andke K, then 
hk = kh, for some h; € H and k; € K. In order to prove that HK is a subgroup 
of G, we must verify that it is closed and every element in HK has its inverse 
in HK. Suppose that x = hk € HK and y =h’k' € HK, where h,h' € H and 
k,k’ € K. Then, we have xy = hkh’k’, but since kh’ € KH = HK, it follows that 
kh' = hyky withh, € H andky € K. Hence, xy = h(h2ky)k’ = (hha) (kok’) € HK. 
Clearly, x~! = k-'h-! € KH = HK. Consequently, HK isa subgroup of G. 


Corollary 8.33 If H and K are subgroups of the abelian group G, then HK is a 
subgroup of G. 


Theorem 8.34 [f H and K are finite subgroups of a group G, then 


_ IAIK! 
|HK| = ———. 
|H K| 
Proof Suppose that A = HN K, and let the index of A in H is n. Then, we can 
write H = h,AUhj,AU---Uh,A, in which h;ANh;A = Y, for any i A j. On 
the other hand, we have HK =h,AK Uh, AK U---Uh,AK. Since AK = K, it 
follows that HK = h,K Uh) K U---Uh,K. We claim that h;K 1h; K = 9, for 
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any i # j. Indeed, ifh;K =h;K ¢ 9, for somei  j, then there exist ki, kz € K 
such that hjk; = hjko, and then hjh;! = kyk;' € A. This implies that hj A = hj A, 
a contradiction. Therefore, we obtain |H K| = n|K|. Since n = |A|/|A|, it follows 
that |HK| = |A||K|/|HN KI. 


Corollary 8.35 If H and K are finite subgroups of a group G such that min{|H\, 
|K|} > /|G|, then HN K & {e}. 


Proof Since HK C G, it follows that |H K| < G. On the other hand, by Theorem 
8.34, we have 


UIK]  VIGIVIGI _ _ Gl 


IG| > |HK|= = 
IHNK| |HOK| |ANK| 


So, we conclude that |H M K| > 1. This yields that HM K # {e}. | 


Definition 8.36 Let G be a group and H, K be two subgroups of G and let x € G. 
Then, the set Hx K = {hxk |h € H andk € K} is called a double coset of H and 
KinG. 


Lemma 8.37 For any x, y € G, two double cosets Hx K and HyK are either dis- 
joint or identical. 


Proof Suppose thata € Hx K 1 HyK.Then,a = hxk = h'yk’,forsomeh, h’ € H 
and k, k’ € K. This implies that x = h7'h’yk’k7! and y = h''hxkk'~'. Now, we 
show that HxK = HyK.Letu € HxK be an arbitrary element. Then, u = h,xk,, 
for some h; € H and k; € K. Consequently, we obtain u = hyh7'h'yk’k-'k, € 
HyK. Conversely, if v € HyK is an arbitrary element, then v = hz ykz, for some 
hy € H and ky € K. Thus, we have v = hah! 'hxkk''ky € HxK. = 


Let x, y € Ganddefine x ~ yifand only ifx = hyk,forsomeh € Handk € K. 
It can be shown easily that ~ is an equivalence relation and the equivalence class of 
xis HxK. 


Lemma 8.38 /f H and K are subgroups of a finite group G and x € G, then 


|H||K| 


egies 2, 
Ix !HxN K| 


Proof Itis easy to see that the function f : HxK — x~!HxK defined by f (hxk) = 
x—'hxk, for all h € H and k € K, is a one to one correspondence. Hence, we 
deduce that |HxK| = |x~'HxK| = |x7'Hx||K|/|x7! Hx N K|. Since |x7!Hx| = 
|H|, the result follows. a 


Theorem 8.39 If H and K are subgroups of a finite group G, then 


|H||K| 
G| = —_____.,, 
IG| pare 
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where summation on right side is taken over elements x chosen from disjoint double 
cosets Hx K. 


Proof For each x € G, we have x € HxK. In view of this, we conclude that G = 


L) Hx K.If we write disjoint union, then we get |G| = }- |Hx K|. Hence, by Lemma 
xEG 
8.38, the result follows. |_| 


Lemma 8.40 Let H and K be subgroups of a group G anda € G. Then, the double 
coset HxK is a union of some right cosets of H in G, and also a union of some 
left cosets of K in G. Moreover, the number of distinct right cosets of H in Hx K 
is [K : K \x~!Hx], and the number of distinct left cosets of K in HxK is [H: 
HOx7!Kx]. 


Proof Clearly, we have 


HxK = (J Axk and HxK = | hxkK. 
keK heH 


Now, assume that k, k’ € K. Then, we have 


Hxk = Hxk’ © (xk)(xk’)"! € H & xkk’'x'€H 
& kk’! ex Hx & kk’! € KN. x7'Hx (since k, k’ € K) 
& k(K x7! Hx) =k (K Nx7'Hx). 


This shows that the number of distinct right cosets of Hin Hx K is[K : KM x! Hx]. 
The proof of the last part is similar. | 


Theorem 8.41 Let H and K be subgroups of a group G. If [G : H] is finite, then 


[G: H]= IK >KNx7'HAx], 
xeEX 


where X is the set of representatives of distinct double cosets. Similarly, if [G : K] 
is finite, then 
[G: K]= DDE: >HOx7' Kx], 
xeX 
Proof We know that 
G= U Axk. 


xeX 


By Lemma 8.40, for any x € X, the number of distinct right cosets of H in Hx K is 
[K : K 1 x~'Hx]. Hence, the number of total distinct right cosets of H in G is 


IG: H)=> [kK nx He), 
xEXx 


The proof of the second part is similar. | 
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Exercises 


1. Give an example of a group G and subgroups H and K such that HK is nota 
subgroup of G. 

2. Let H, K,and N be subgroups of a group G such that H < K, HN N= KON 
and HN = KN. Show that H = K. 

3. If A and B are non-empty finite subsets of a group G, prove that G = AB or 
|A| + |B] < |GI. 

4. (a) (Dedekind’s Modular Law). Let H, K, and N be subgroups of a group G 

and assume that K < N. Prove that 


(HK)ON =(HON)K. 
(b) In particular, if H and K permute, prove that 
(H,K)ON=(AON,K). 
(c) Can you give an example to show that (HK) WN is not equal to (HN 
N)\(K NN)? 


e Find the number of left cosets of K which are contained in the double coset 
AXxK. 

e Let H, K be two subgroups of G. Prove that every subgroup of G containing 
both A and K contains the product sets HK and KH. 


8.6 Worked-Out Problems 


Problem 8.42 Let G = (a) be an infinite cyclic group and let H = (a“), where k is 
a positive integer. Show that 


H, aH, @H,..., a 'H (8.1) 


is a complete list repetition free of the left (right) cosets of H in G. 


Solution The problem is subdivided into proving that 


(1) The list (8.1) is complete, i.e., every element of G belongs to one of the left 
cosets listed; 
(2) The list (8.1) is repetition free. 


(1) Suppose that x is an arbitrary element of G. Then, there exists positive integer n 
such that x = a”. Now, by the Division algorithm, we can writen = gk + r, for some 
integers g and r with 0 <r <k. So, we have x = a" = al" =a'(a*)1 ca’ F. 
Since 1 <r < k —1, it follows that aH is one of the left cosets in the list (8.1). 
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Therefore, every element of G lies in one of the left cosets in the list (8.1). 
(2) Suppose (with a view to obtaining a contradiction) that a’ H =a/H with 
0 <i < j <k-—1. Then, we get (a')~'a/ € H, or equivalently a/~' € H. Hence, 


we conclude that a/~’ = a’"*, for some integer m. Since a has infinite order, it follows 
that 7 — i = mk. This is acontradiction sinceO < j —i < k. From this contradiction 
it follows that the list (8.1) is repetition free. a 


Problem 8.43 Let d(G) be the smallest number of elements necessary to generate a 
finite group G. Prove that |G| > 27). Note that by convention d(G) = Oif |G| = 1. 


Solution We do the proof by mathematical induction. If d(G) = 1, then |G| > 2, 
because each non-identity element has order of at least 2. Suppose that if a 
group is generated by n — 1 elements, then its order is at least 2”~!. Now, let 
G = (aj, a2, ..., Gy), i.e., G is generated by n elements. Then, the subgroup 
H = (aj, a, ..., Gn—1) iS a proper subgroup of G, and hence by assumption we 
have |H| > 2”—!. Since a, ¢ H, it follows that a, H M H = @. In addition, we have 
da, H UH C G. Consequently, we obtain 


|G| > la, U A| = |anH| + || > 2)H|= 2-2") = 2", 


and we are done. |_| 


Problem 8.44 Prove that a group has exactly three subgroups if and only if it is 
cyclic of order p?, for some prime p. 


Solution Suppose that G is a cyclic group of order p?. By Theorem 4.31, G has a 
unique subgroup H of order p. Therefore, the subgroups of G are {e}, H and G. 

Conversely, assume that G is a group which has exactly three subgroups. Then, 
we conclude that there exists only one non-trivial proper subgroup H of G. Since H 
has no non-trivial subgroup, it follows that H is a group of order p, for some prime 
p.Let H = (a). Since G ¥ H, it follows that there exists b € G — H. Now, (b) isa 
subgroup of G different from H. Hence, we conclude that (b) = G. This means that 
G is cyclic, and it has a subgroup of order p. Thus, G is finite and the only prime 
divisor of |G| is p. Consequently, |G| must be p”, otherwise G has a subgroup from 
other divisors. 


Problem 8.45 Let H and K be subgroups of a group G, and suppose that L is a 
left coset of H in G and R,, R> are two right cosets of K in G. If LM R; 4 @ and 
LO R>2 # G, prove that |LN R,| = |Z Rol. 


Solution Assume that a € LM R; and b € LM R>. Then, we have L = aH = bH, 
R, = Ka and R2 = Kb. So, we obtain 


LOR, =aH Nl Ka = (aHa!NB)a, 
LOR, =bHN Kb= (bob! K)b. 


Since aH = bH, it follows that a~'!b € H or b~!a € H. This shows that Ha~! = 
Hb-', Consequently, we can write LAR; = (aHa~'!NK)a and LOR) = 
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(aHa'M K)b. Since aHa~' 1 K is a subgroup of G, it follows that LM R; and 
LO R2 are right cosets of aHa~!N K inG. This forces |LARi|=|LAR2|. 


Problem 8.46 Let G be a group of order 100 that has a subgroup H of order 25. 
Prove that every element of G of order 5 is in H. 


Solution Suppose that a is an element of G of order 5. Then, by Theorem 8.34, we 
can write 
(a)||H| 5-25 125 


\(a)H| = J = = 
ay HH} (ay) [ayn 


Since (a)H C G, it follows that |(a) H| < 100. This forces |(a) N H| > 1. Since 
\(a)| = 5 and (a)N A C (a), it follows that |(a) H| = 5. This yields that |(a) H| = 
(a), and hence we conclude that a € H. | 


Problem 8.47 Prove that if G is a finite group, the index of Z(G) in G cannot be 
prime. 


Solution Suppose that [G : Z(G)] = p, where p is a prime. Since p > 1, it follows 
that G ~ Z(G), and so there exists a € G — Z(G). If Cg(a) is the centralizer of a 
in G, then Z(G) is a subgroup of Cg(a). Sincea € Cg(a) anda ¢ Z(G)), it follows 
that Cg(a) 4 Z(G). Hence, we conclude that 


[Cg(a) : Z(G)] > 1. (8.2) 

On the other hand, by Theorem 8.29, we have 
p=(G: Z(G)]=[G: Cg@]lCe(@) : Z(G)]. (8.3) 
Since p is prime, by (8.2) and (8.3), it follows that [G : Cg(a)] = | and [Cg(a): 
Z(G)] = p. This shows that G = Cg(a). It means that every element of G commutes 


with a, and itis acontradiction asa ¢ Z(G). Therefore, we conclude that [G : Z(G)] 
cannot be prime. a 


8.7 Supplementary Exercises 


1. Let G be a finite group whose order is not divisible by 3. Suppose that (ab)? = 
a®b°, for alla, b € G. Prove that G must be abelian. 

2. Letn =n, +----+7n, be a partition of the positive integer n. Use Lagrange’s 
theorem to show that n! is divisible by 


r 
| [xt 
i=1 


8.7 
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. Let G be a finite abelian group and let m be the least common multiple of the 


orders of its elements. Prove that G contains an element of order m. 


. Let G be a finite group. Prove that the number of elements x € G such that 


x’ =e is odd. 


. If Gis a finite group with precisely 2 conjugacy classes, prove |G| = 2. 
. Ifo € S, has order p, p is a prime, and (n, p) = 1, show that ro = r, for some 


l<r<n. 


. If in a group G of order n, for each positive integer m|n, the equation x” = e 


has less than m + g(m) solutions, then show G is cyclic. 


. In acyclic group of order n, show that for each integer m that divides n there 


exist g(m) elements of order m. 


. Suppose that G is an abelian group with an odd number of elements. Show that 


the product of all of the elements of G is the identity. 


. Let G be a group such that |G| < 200. Suppose that G has subgroups of order 


25 and 35. Find the order of G. 


. Let G be a group of order pgr, where p, q, and r are distinct primes. If H and 


K are subgroups of G with |H| = pq and |K| = qr, prove that|HM K| =q. 


. Let G be the set of all matrices of the form 


2* p(x) 
0 1 , 
where k € Z and p(x) is any polynomial with rational coefficients. Show that 


(a) G is a group under matrix multiplication; 

(b) There exists a subgroup H of G and an element u € G such that the left coset 
uf contains an infinite number of right cosets of H in G; 

(c) The chain H C uHu7! Cu?Hu~ C.--- is an infinite proper ascending 
chain. 


. Let G be an abelian group and suppose that G has elements of orders m and 


n, respectively. Prove that G has an element whose order is the least common 
multiple of m and n. 


. In Theorem 8.31, show that equality holds if and only ifG = HK. 
. Let H be a subgroup of a group G such that [G : H] = p, where p is a prime 


number. If H is a subgroup of Z(G), prove that G is abelian. 


. Let G be an abelian group of order n and a),...,a, be elements of G. Let 


X = ad... dy. Show that 


(a) If G has exactly one element of order 2, then x = b; 
(b) If G has more than one element of order 2, then x = e; 
(c) Ifn is odd, then x = e. 


Let H and K be subgroups of a finite group G. Show that 


(a) [H: HO K|]<[HVK: K],where HV K =(HUK); 
(b) If[G: K] <2}/H: HN K],thnG=HvkK. 
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18. 


19. 


20. 


21. 


8 Cosets of Subgroups and Lagrange’s Theorem 


Let n > 1. Show that there exists a proper subgroup H of S, such that [S, : 
H\ <n. 


Using Lagrange’s theorem, for any integers m and n, prove that 
(mn)! (mn)! 
(m!)" (m!)"n! 


are integers. 

Hint: Consider the integers | to mn ina family of consecutive integers as follows: 
{1, ..., m},{m+1, ..., 2m},..., {m—Dm+1, ..., nm}. The elements 
of symmetric group S,,,, that move each set within itself is a subgroup of Sinn. 
Let G be a group and H be a subgroup of G. Prove that 


U a! Ha| 2Y1GISI1G Sk: 
acéG 


Suppose that G has exactly m subgroups of order p, where p is a prime number. 
Show that the total number of elements of order p in G is m(p — 1). Deduce 
the following results: 


(a) A non-cyclic group of order 55 has at least one element of order 5; 
(b) A non-cyclic group of order p? has altogether p + 3 subgroups. 


Chapter 9 ®) 
Normal Subgroups and Factor Groups cies 


The set of cosets of a subgroup H has no group structure. We are now interested in 
a criterion on H to give the set of its cosets a group structure. In this chapter, we 
introduce the concept of normal subgroups and we form a group of cosets, say factor 
group. A factor group is a way of creating a group from another group. This new 
group often retains some of the properties of the original group. 


9.1 Normal Subgroups 


There is one kind of subgroup that is especially interesting. If G is a group and H is 
a subgroup of G, it is not always true thataH = Ha for alla ¢€ G. There are certain 
situations where this does hold, however, and these cases turn out to be of critical 
importance in the theory of groups. It was Galois, who first recognized that such 
subgroups were worthy of special attention. 


Definition 9.1 Let G be a group and N be a subgroup of G. We say that N is a 
normal subgroup of G, or that N is normal in G, if we have aNa~! CN, for all 
aeG. 


Equivalently, a subgroup N of Gis normalifaxa~! € N,foralla € Gandx € N. 
Note that, also, we can say a~'xa7! € N, foralla € Gandx EN. 


Example 9.2 Clearly, the trivial subgroups G and {e} of a group G are normal 
subgroups. 


Example 9.3 If G is an abelian group, then all its subgroups are normal. 
Example 9.4 The center of a group G forms a normal subgroup of G. 
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Example 9.5 Consider the dihedral group D,, generated by R and S with R” = Id, 
S* = Id, and SRS = R™'. If N is the subgroup of rotational symmetries, then N is 
normal in D,,. 


We denote it N <1 G, or N <] G when we want to emphasize that N is a proper 
subgroup of G. The condition for a subgroup to be normal can be stated in many 
slightly different ways. 


Definition 9.6 Let G be a group and X be a non-empty subset of G. The set 
Ng(X) = {ae G|aXa7! = X} 
is called the normalizer of X in G. 


Theorem 9.7 If G is a group and X is a non-empty subset of G, then Ng(X) is a 
subgroup of G. In particular, if H is a subgroup of G, then H < Ng(A). 


Proof It is straightforward. | 


Theorem 9.8 Let N be a subgroup of a group G. The following conditions are 
equivalent: 


(1) N is anormal subgroup of G; 

(2) aNa~! =N, foralla € G; 

(3) aN = Na, foralla € G; 

(4) Every left coset of N in G is also a right coset of N in G. 


Proof (1 => 2): Let N be a normal subgroup of G. Then, for each a € G, we have 
aNa! CN. IfaeG, then a! € G, and so a~!N(a!)“! =a7!Na C N. This 
yields that N C aNa7'. Thus, we conclude that aNa~! = N. 

(2 = 3): Let a € G be arbitrary. Since aNa~! = N, it follows that aNa~'a = 
Na, or equivalently aN = Na. 

(3 => 4): It is straightforward. 

(4 = 1): Suppose that every left coset of N in G is a right coset of N in G. Thus, 
fora € G, aN being a left coset, must be a right coset. Let aN = Nb, for some 
b eéG. Since a € aN, it follows that a € Nb. Hence, we get Na C NNb C Nb. 
This shows that Na = Nb, and soaN = Na. Consequently, aNa~! = N, and this 
implies that N is anormal subgroup of G. a 


Theorem 9.9 For each positive integer n, Ay is a normal subgroup of Sh. 


Proof Assume that w € S, is a product of k transpositions, say a = T, ... tT. Then, 
we have a~! = a ne a ae Now, if o € A,, then o is even, say a product of 2m 
transpositions. Consequently, waa! is a product of k + 2m + k = 2(k +m) trans- 
positions, and so aca! is even. | 


Theorem 9.10 SL, (F) is anormal subgroup of GLy(F). 
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Proof For every A € GL, (IF) and B € SL, (F), we have 
det(ABA~!) = det(A) det(B) det(A~!) = det(B) = 1. 


This shows that ABA~! € SL2(F), and so SL,(F) < GL, (EF). | 
Lemma 9.11 Let G be a group andN <G.IfN < H <G, thenN SH. 
Proof It is clear. | 


Theorem 9.12 Let H and N be subgroups of a group G. If N 1G, then HN = 
NH <G. 


Proof By Theorem 8.32, it is enough to show that HN = NH. Let x € HN be 
an arbitrary element. Then, there exist h € H and n € N such that x = hn. Since 
N <1G, it follows that hnh—! € N. Hence, we conclude that hn € Nh C NH, or 
equivalently, x ¢ NH. This shows that HN C NH. Analogously, we observe that 
NH C HN. Therefore, we get HN = NH. | 


Theorem 9.13 Let H and N be normal subgroups of a group G. Then, 


(1) HN is anormal subgroup of G; 
(2) HN is anormal subgroup of G. 


Proof (1) Leta € G and x €e HNN be arbitrary. Since H <1 G, it follows that 
axa~' € H. Since N<IG, it follows that axa~' € N. Consequently, we have 
axa'€ HON. 

(2) By Theorem9.12, HN is a subgroup of G. Now, let a € G and hn € HN be 
arbitrary. Then, we get ahna~! = (aha~')(ana~') € HN, using the normality of 
both H and N. a 


Theorem 9.14 Let G be a group, not necessarily finite, and let N be a subgroup of 
G such that the index [G : N] = 2. Then, N is anormal subgroup of G. 


Proof Since [G : N] = 2, it follows that N has two left cosets and two right cosets. 
One of them is always WN itself. Take a ¢ N. Then, aN is the other left coset, Na 
is the other right coset, and NUaN = N U Na = G. But these are disjoint unions, 
so aN = Na, and therefore aNa~! = N. This equation holds for any a in the coset 
aN. The equation clearly holds for any element of the coset NV. Hence, the equation 
holds for all elements of G, and we conclude that N is normal. | 


In the following example, we present three groups K, H and G such that K < H 
and H <JG, but K is not normal in G. 


Example 9.15 Let D4 = (R, S) be the dihedral group of order 8. Since [(R7, S) : 
(S)] = 2, it follows that (S) < (R?, S). Since [D4 : (R*, S)] = 2, it follows that 
(R?, S) <l D4. But (S) is not a normal subgroup of D4. 
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Theorem 9.16 /f p is the smallest prime number that divides the order of a finite 
group G, then every subgroup of G with the index p is a normal subgroup. 


Proof Let N be a subgroup of a finite group G of order n such that [G : N] = p. 
First we claim that if a ¢ N, then a ¢ N, for all | <i < p —1. Indeed, if this 
statement is not true, then there is 1 < k < p—1 such that a € N. Suppose that 
j is the least positive integer such that a/ € N, ie., forl <t< j—1,a' ¢ N. Let 
o(a) = m.Sincem|n,1 < j < pand pis the smallest prime factor of n, we conclude 
that j {m. Hence, there exist integers g andr such that m = qj + r and0 <r < j. 
Now, we can write 
e=a" =atit" = (al)‘a". 


This implies that x” = (al) *. Since a/ € N, it follows that a” € N, and it is a 
contradiction. Therefore, we proved that our claim holds. 

In order to prove the theorem, assume that N is not a normal subgroup of G. 
So, there exist a € G and x € N such that axa~! ¢ N. Obviously, a ¢ N, and so 
by the above discussion, we deduce that a' ¢ N, for every 1 <i < p—1. On the 
other hand, if a’ N = a/N, then there is y € N such that a‘ = a/y, or equivalently 


ai~J = y, and it is a contradiction. Consequently, N,aN,...,a?~'N are disjoint 
left cosets of N in G. On the other hand, assume that axa~! = b. Since b ¢ N, in 
a similar way, it follows that N,bN,..., b?-!N are disjoint left cosets of N in G, 


too. Thus, the following two families of left cosets 
{N, aN, ..., a?~'N}and{N, DN, ..., b?~'N} 


are equal. So, aN = b’N, for some 1 <r < p— 1. This means that a = bz, for 
some z € N. Now, we obtain 


a=bD'z= (axa~')'z = ax'a-'z., 


Hence, we have e = x"a~!z, or equivalently a = zx". Since x and z belong to N, 
it follows that a € N, a contradiction. Therefore, we conclude that N is a normal 
subgroup of G. a 


Theorem 9.17 Let G be a group and H and K be normal subgroups of G. If |H 
K| = 1, then hk = kh, forallh € Handk eK. 


Proof Suppose that h € H and k € K are arbitrary. We consider the element 
hkh-'k~'. On the one hand, since H <I G, it follows that h(kh~'k~') € H, and 
on the other hand, since K < G, it follows that (hkh—')k~!) € K. So, we obtain 
hkh'k-'! € HK = {e}. This implies that hkh~'k~! = e, or equivalently, hk = 
kh. | 
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Exercises 
1. Let G bea group in which, for some integern > 1, (ab)” = a"b" foralla, b € G. 


lon 


13. 


14. 


Show that the subset H = {x"~! | x € G} is anormal subgroup of G. 


. Find all normal subgroups of $3. 
. If N is anormal subgroup of G with |N| = 2, prove that N < Z(G). 
. Let N be a normal subgroup of $4. 


(a) If N contains a transposition, prove that N = Su4; 

(b) If N contains a cycle of length 3, prove that N = Aa; 
(c) If N contains a cycle of length 4, prove that N = Su; 
(d) Find all the normal subgroups of Sy. 


. Show that if a finite group G has exactly one subgroup JN of a given order, then 


N is anormal subgroup of G. 


. If Gis a group and N is a normal subgroup of G, show that Cg(H) 1G. 
. Prove that if N is a normal subgroup of the group G, then Z(N) is a normal 


subgroup of G. Show by an example that Z(V) need not be contained in Z(G). 


. Let H be a subgroup of order 2 in G. Show that Ng(H) = Cg(#). Deduce that 


if Nc(H) = G, then H < Z(G). 


. If A is an abelian group with A < G and B is any subgroup of G, prove that 


AN BSAB. 


. Show that if H and N are subgroups of a group G, and N is normal in G, then 


HN is normal in H. Show by an example that HM N need not be normal in 
G. 


. Let N be a normal subgroup of a finite group G. If N is cyclic, prove every 


subgroup of N is also normal in G. 


. Let G be the set of all triples of the form (a1, az, 1) or (a), a2, —1), where a, 


and a2 are integers. Define a binary operation on G by the rule 


(a1, a2, 1)(b1, bz, c) = (a) + bp, a + bp, €), 
(a1, a2, —1)(h, bo, c) = (ay + bo, a2 + by, —c), 


where c = +1. Prove that 


(a) G is a group; 

(b) H = (1,0, 1), (0, 1, 1)) is anormal subgroup of G; 
(c) K = (C1, 0, 1)) is a normal subgroup of H; 

(d) Is K anormal subgroup of H? 


If N is a normal subgroup such that [G : N] =n, show that x” € N, for all 
x EG. 

Let H bea proper subgroup of G such that for allx, ye G \ H, xy € H. Prove 
that H is anormal subgroup of G. 


222 9 Normal Subgroups and Factor Groups 


9.2 Factor Groups 


A factor group is a way of creating a group from another group. This new group 
often retains some of the properties of the original group. 


Theorem 9.18 Let G be a group and N be anormal subgroup of G. The set G/N = 
{aN | a € G} is a group under the binary operation (aN)(bN) = abN, for allaN 
and bN inG/N. 


Proof Our first task is to show that the operation is well defined. In order to do 
this, suppose that aN = cN and bN = dN, for some a, b, c and d in G. Then, we 
conclude c € aN andd € bN. Hence, there exist nj, n2 € N such that c = an, and 
d = bn». Now, we have 


(ab)"!cd = b“!a~!cd = b“!a7anbnz = b“'nybny. 


Since N <I G it follows that b-'n,b € N. Therefore, we deduce that (ab)~'cd € N. 
This forces that abN = cdN. 

Computing aN(bNcN) =aN(bc)N =a(bc)N, and _ similarly, we have 
(aNbN)cN = (ab)NcN = (ab)cN. So, associativity in G/N follows from asso- 
ciativity in G. SinceaNeN = aeN =aN = eaN = eNaN, itfollows thateN = N 
is the identity element in G/N. Finally, a~'NaN = (a~'a)N = N = (aa™'|)N = 
aNa“'N shows that a~!N is the inverse of aN. This proves that G/N is a group. ll 


The group G/N which was defined in Theorem 9.18 is called the factor group of 
G by N. 


Theorem 9.19 Jf G is a finite group and N 1G, then |G/N| = |G|/|N|. 
Proof The proof follows from Lagrange’s Theorem. | 


A factor group is also called the quotient group of G by N. Further, if the binary 
operation in G is addition, then each coset of N in G is denoted by a + N and the 
binary operation in G/N is also denoted additively, i.e., we write (a + N) + (b+ 
N)=(at+b)+N. 


Example 9.20 Let G = Zig and N = (6). Then, G/N ={0+N, 14+N, 2+ 
N, 3+N, 44+N, 5+N}. 


Example 9.21 Let U>5 be the group of units in Zs under multiplication modulo 25. 
We have |U25| = y(25) = 20. If N is the subgroup generated by 7, then N = (7) = 
{1, 7, 18, 24}. Since Uys is abelian, it follows that every subgroup including N is 
normal. Since [U25 : N] = |U25|/N = 5, it follows that there exist 5 left cosets of N 
in U25. We write them out 
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N ={l, 7, 18, 24}, 

ON = (2-11, 14, 23}, 
3N = (3, 4, 21, 22}, 
6N = {6, 8, 17, 19}, 
ON = {9, 13, 12, 16}. 


There are many ways to name these left cosets, because we can take any number of 
the left coset to stand in for the whole set. For example, 2N = 11N = 14N = 23N. 
Then, the Cayley table for U25/N is: 


H | H 2H 3H 6H 9H 
H | H 2H 3H 6H 9H 
2H| 2H 3H 6H 9H H 
3H|3H 6H 9H H 2H 
6H | 6H 9H H 2H 3H 
9H| 9H H 2H 3H 6H 


Example 9.22 Let G be a group such that (ab)? = a?b? for all a, b € G, where p 
is a prime number. Let 


N = {x € G| x?" =e for some m depending on x}. 


Then, N is anormal subgroup of G. If G = G/N and if X € G is such that x” = @, 
then x = e. 


Theorem 9.23 IfG isa group and N <1 G, then any subgroup of G/N is in the form 
of H/N such that NH <G. 


Proof Suppose that S is a subgroup of G/N. Define H = {x € G| xN € S}. First, 
we show that H is a subgroup of G. Since S < G/N, it follows that N € S. This 
means that e € H, and so H is non-empty. Now, leta,b € H. Then,aN,bN ES. 
Since S is a subgroup, it follows that aN(bN)~! = aNb-!N = ab™'N € S. This 
implies that ab~! € H, and hence H is a subgroup of G. So, by Lemma9.11, we 
conclude that V < H.Now, suppose that A «¢ H/N.Then, A = aN,forsomea € H, 
and so A=aN € S. Consequently, we obtain H/N CS. On the other hand, if 
AéS, then A=aN, for some a € H, and hence A € H/N. Therefore, we get 
S = H/N,inwhichN IH <G. | 


Theorem 9.24 Let G be a group and N 1G. Then, H/N G/N if and only if 
NSIHAG. 


Proof If N <H <G, then for every aN € G/N and hN € H/N we have a“! 
NhNaN = a"'ha € H,whichimplies that (aN)~'hNaN € H/N. This shows that 
H/N G/N. 

Conversely, if H/N G/N, then by Theorem9.23, we obtain NIH <G. 
Now, for every a € G and h € H, we have (aN) 'hNaN € H/N, or equiva- 
lentlya~'haN € H/N.Hence, there exists h’ €¢ H suchthata~'haN = h'N, which 
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implies that h’'a'ha EN. Since N is a subgroup of H, it follows 
that h’'a~'ha € H. Therefore, we conclude that a~'ha € H. This completes the 
proof. a 


Exercises 


. If G is an abelian group and N is a subgroup of G, show that G/N is abelian. 

. If Gis acyclic group and N is a subgroup of G, show that G/N is cyclic. 

. What is the order of the factor group Zeo/ (15)? 

. Let G be a group with G/Z(G) abelian, and let m € N be odd. Prove that 
G™ := {x | x © G} is anormal subgroup of G. 

5. Prove or disprove: If N is a normal subgroup of G such that N and G/N are 
abelian, then G is abelian. 

6. Prove or disprove: If N and G/N are cyclic, then G is cyclic. 

7. Suppose that N is a normal subgroup of a finite group G. If G/N has an element 
of order n, show that G has an element of order n. Show, by example, that the 
assumption that G is finite is necessary. 

8. Let N be a normal subgroup of a group G and let a belong to G. If the element 
aN has order 3 in the group G/N and |N| = 10, what are the possibilities for 
the order of a? 

9. Suppose that N is a normal subgroup of a group G. If |N| = 4 and aN has order 
3 in G/N, find a subgroup of order 12 in G. 

10. If N is a normal subgroup of a group G such that N and G/N are finitely 
generated, prove that so is G. 

11. Let UT2(F) be the group of invertible upper triangular matrices with entries in 

F, that is matrices of the form 


BRWN Re 


where x € F. 


(a) Show that N is an abelian subgroup of UT)(F); 
(b) Prove that N is normal in U7>(F); 

(c) Show that UT(F)/N is abelian; 

(d) Is UT,(F) normal in GL2(F). 
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12. Let G be a group and let K < H < Gwith K <G. Prove that 


“fe (4) _ No(H) 


K K 


9.3. Cauchy’s Theorem and Class Equation 


Cauchy’s Theorem gives some converse to Lagrange’s Theorem. 


Theorem 9.25 (Cauchy’s Theorem for Abelian Groups) Let G be a finite abelian 
group and p be a prime that divides the order of G. Then, G has an element of order 


Pp. 


Proof We establish this theorem by mathematical induction on |G]. In other words, 
we assume that the statement is true for all abelian groups having fewer elements 
than G, and we use this assumption to show that the statement holds for G as well. 
To start the induction we note that the statement is vacuously true for group having a 
single element. So, let |G| > 1. If G has no trivial subgroups, then G must be cyclic 
of prime order, and this prime must be p. Hence, there is a € G such that G = (a). 
This yields that |G| = o(a) = p, and we are finished. 

Now, we assume that G has a non-trivial subgroup NV. We consider two cases: 

Case 1: Let P| |N |. In this case, by our induction hypothesis, since N is abelian 
and |N| < |G|, it follows that there exists a non-identity element x «© N C G such 
that o(x) = p, and we are done. 

Case 2: Let p {|N|. Since G is abelian, it follows that N is a normal subgroup 
of G. Hence, we may construct the factor group G/N. Moreover, G/N is abelian. 
Since |G/N| = |G|/|N| and p { ||, it follows that p||G/N| < |G|. Hence, by our 
induction assumption, there is a left coset x N € G/N such that o(xN) = p. Then, 
we obtain (xN)? = x?N = N. This implies that x? € N and x ¢ N, consequently 

ae) = x?INl — e, Take y = x!%!, then y? = e. At the end, we must show that 
y #e. Indeed, if y = e, then x!¥! = e, and so (xN)!*! = N. On the other hand, 
since (xN)? = N and p{ |N|, we conclude that xN = N, or equivalently x € N, 
it is a contradiction. Consequently, y #4 e and y” = e. Therefore, y is the desired 
element of order p. | 


Theorem 9.26 If G is a finite abelian group and a positive integer m divides |G|, 
then G contains a subgroup of order m. 


Proof We proceed by mathematical induction over |G|. If |G| = 1, then m must be 
1. In this case G is its own subgroup of order m. To apply induction, let |G| > 1 
and assume that the statement is true for every abelian group of order less than 
|G|. We can suppose that m > 1, because for m = 1, {e} is the subgroup of G of 
order |. Let p be a prime number such that p|m. We conclude that P| |G|. Then, by 
Cauchy’s Theorem, there is a € G such that o(a) = p. Let N = (a) be the cyclic 
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group generated by a. Then, we have |N| = p. Now, G/N is an abelian group 
such that |G/N| = |G|/|N| < |G|. Since p|m, it follows that m = kp, for some 
positive integer k. Moreover, we have k| |G/N|. Hence, by our induction assumption 
and Theorem 9.23, G/N has a subgroup H/N of order k, where A is a subgroup 
of G containing N. Consequently, we obtain |H| = |H/N|-|N| =kp =m. This 
completes the proof. a 


Let a, b € G. We recall that b is a conjugate of a in G if b = xax™', for some 
x €G, and in this case, we write a ~Conj b. In Theorem3.53, we showed that 
the conjugacy relation is an equivalence relation. For each a € G, let C(a) = {b € 
G | a ~conj 5}, the equivalence class of a € G under ~¢,,; relation, and it is usually 
called the conjugate class of a € G. 

Our attention now narrow to the case in which G is a finite group. Let G be a 
finite group, and suppose that C(a;), C(az2),..., C(am) are the totally of all con- 
jugate classes of G. For each a € G, let cg = |C(a)|. Since conjugacy classes are 
disjoint and their union is G, we obtain 


m 
IG) = Dea: 
i=1 


Theorem 9.27 If G is a finite group, then 


1G 
Ca ae ey ener | 
ICo(a)| 


in other words, the number of elements conjugate to a in G is the index of the 
centralizer of a in G. 


Proof Let Cg(a) has k distinct left cosets, say x;Cg(a), x2CgG(a),...,x~xCg(a). 
Then, we know that k = [G : Cg(a)]. We claim that for every i 4 j, apaxe and 
i eh ‘ . -1_ ~1 ; ; 
Xjax; are distinct conjugate of a. Indeed, if xjax,; = Xjax;, forsomei 4 j, then 

we conclude that 


ae ae 2 =a> Gy apaGe =a> (x7! xia = a(x; 'x;) 
= x71x; € Ce(a) > xiCo(@) = xjCe(@) > i = j. 


Now, we show that KiGX, Ome se are all distinct conjugate of a. Let b = 
xax', for some x € G. Since 


k 
G= U x;Cg(a), 
i=l 


it follows that x = x;y, for some y € Cg(a) and positive integer i. Then, we can 
write 
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-1 _ fy. ee ee has sd 
xax~* = (xjy)a(xjy)~ = xjyay x, = xjax; . 


Consequently, any conjugate b of a is equal to one of the xjax; ' Therefore, a has 
exactly k conjugate. This yields that C(a) contains exactly k elements. a 
Corollary 9.28 If G is a finite group, then 
c= 
— |Co(a)| 
where this sum runs over one element a in each conjugate class. 
Proof Since |G| = >> ca, using Theorem 9.27, the result immediately follows. Il 
Lemma 9.29 Let G be a group. Then, a € Z(G) if and only if Cg(a) = G. 
Proof It is straightforward. | 
In Lemma 9.29, if G is finite, then |Cg(a)| = |G| is equivalent to Cg(a) = G. 


Corollary 9.30 Let G be a finite group, then 


|G| 


IG) =|Z(G)|+ >> THEN 


a 


where the sum runs over elements a, taken one from each of those distinct conjugate 
classes which contains more than one element. 


Proof By Lemma9.29, a € Z(G) if and only if cg = 1. Since there are |Z(G)| 
number of conjugate classes each having only one element, the corollary follows. 


The equation in Corollary 9.30 is usually referred to as the class equation of G. 


Theorem 9.31 (Cauchy’s Theorem) Let G be a finite group and P| |G|. Then, G has 
an element of order p. 


Proof We do the proof by mathematical induction. The statement is vacuously true 
for groups of order 1. We assume the statement holds for all groups having fewer 
elements than G, and then we prove the statement is true for G. We consider the 
following two cases: 

Case 1: There exists a proper subgroup H of G such that P| |H|. Then, by our 
induction assumption there exists an element a of order p in H, and so it is also in 
G. 

Case 2: p is not divisor of the order of any proper subgroup of G. If G 4 Z(G), 
then there exists a ¢ Z(G). Hence, we obtain Cg (a) 4 G. Since Cg (a) is a subgroup 
of G, by our assumption, we conclude that p { |Cg(a)|. We write down the class 
equation: 
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|G| 
ICe(@| 


IG|=IZ@i+ > 


Ce (4G 
Since P\IG| and p { |Cg(a)|, it follows that 


; |G| 
ICg(a)| 
Thus, we obtain 
|G| 
IC (a)| 


P 
Co@4G 


Since also we have p | |G|, it follows that 


P|(IGi - > wo) 


Co(a)4#G 


and consequently, P| |Z(G)|. But we assumed that p is not divisor of the order of 
any proper subgroup of G, so we conclude that Z(G) can not be a proper subgroup 
of G. This means that Z(G) = G, i.e., G is abelian. Now, by Cauchy Theorem for 
abelian groups, the result follows. | 


Exercises 


1. In this exercise, we obtain a simple proof of Cauchy’s Theorem. Let G be a finite 
group of order n and p be a prime number such that p|n. Consider the set 


S = {(Q, a2,...,4p) | a; € Gand aja2... dy = e}. 


(a) Compute |S], the number of elements in S; 

(b) Define a relation p on S by saying two p-tuples are related if one is a cycle 
permutation of the other. For example, (a), a2, ..., dp) (2, 43, ...,@p, a1). 
Show that p is an equivalence relation. 

(c) If all component of a p-tuple are equal, show that its equivalence class con- 
tains only one element; otherwise, if two components of a p-tuple are distinct, 
there are p elements in the equivalence class; 

(d) Let r denote the number of solutions to the equation x” = e. Then, r equals 
the number of equivalence classes with only one element. Let s denote the 
number of equivalence classes with p elements. Show that r + sp = n?7!; 
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(e) Deduce Cauchy’s Theorem from the above, namely, G has kp solutions to 
the equation x”? = e. 


2. Ifall non-identity elements of a group G have the same order, show that this order 
is a prime p and |G| is a power of p. 

3. Prove that any normal subgroup of order 2p, where p is a prime number, has a 
normal subgroup of order p. 

4. If G is an abelian group of order p; pz... pg, where p), P2,..., Pe are distinct 
primes, prove that G is cyclic. 


9.4 Worked-Out Problems 


Problem 9.32 Let H be a cyclic subgroup of a group G and let HIG. If Nisa 
subgroup of H, prove that N IG. 


Solution Suppose that H = (x) is anormal subgroup of G and leta ¢€ G is arbitrary. 
Since H is normal, it follows that axa—! € H. Hence, we have axa~! = x*, for some 
integer k. If N is a subgroup of H, then AN is cyclic and generated by x”, for some 
integer n. Now, we obtain ax"a~! = (axa7!)" = x" = (x")* € N. This completes 


the proof. | 
Problem 9.33 Let N be a normal subgroup of a finite group G. If |N| and[G : N] 
are relatively prime, prove that any element a € G satisfying a!%! = e must belong 
to N. 


Solution Suppose thata € G such that a!%! = e. Since ([G : N], |N|) = 1, it follows 
that there exist integers m and n such that m[G : N]+n|N| = 1. Then, we deduce 
that 

a= qmlG:N]tn|N| = GMa N]gnlN| = ql] (alN1)" = qe]. 
Now, we consider aN as an element of G/N. Then, we have(aN)!0/"! = al@/NIN = 
N, the identity element of G/N. This yields that a!°/"! © N, and consequently, 
a= (alG:NI)" EN. | 


Problem 9.34 Let G bea group and Z(G) be the center of G. If G/Z(G) is cyclic, 
prove that G is abelian. 


Solution To start, every elements in the factor group G/Z(G) is a left coset aZ(G) 
with a € G. Since G/Z(G) is cyclic, it follows that G/Z(G) = (xZ(G)), for some 
x € G. Now, let a and b be elements of G so that aZ(G) and bZ(G) belong to 
G/Z(G). It follows that there exist integers i and j such that aZ(G) = (xZ(G))' = 
x'Z(G) and bZ(G) = (xZ(G))’ = x/Z(G). Hence, a = x'z, and b = x/zp, for 
some Z1, Z2 € Z(G). Then, by the definition of center and laws of exponents, we 
have 
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ab = (x'21) (0422) = x! x) 2122 = x4 x!29z1 = (4/22) (x'z1) = ba. 
Now, we conclude that G is abelian. |_| 


Problem 9.35 If G is a group of order p”, where p is a prime number, prove that 


Z(G) F {e}. 


Solution Suppose that |Z(G)| = m. By the class equation, we have 


7 IG 
eee owl Co(@)l’ 


where sum runs over element a, taken one from each of conjugate class which has 
more than one element. Now, foreacha ¢ Z((G), we have |Cg(a)| | |G|. This implies 
that |Cg(a)| = p, for some integer | < kg <n. Hence, we obtain 


= -— 
P\\P pke a : 
a 


Since e € Z(G), it follows that m is non-zero. Hence, m is a positive integer 
divisible by the prime number p, and so we conclude that m > 1. This shows that 
Z(G) F {e}. a 


Problem 9.36 Let G be a group of order p”, where p is a prime number. Prove that 
every subgroup of order p”~! of G is normal. 


Solution Suppose that H is a subgroup of G and |H| = p”~!. We establish the proof 
by mathematical induction. If nm = 1, then |G| = p and H = {e} <1 G. Suppose that 
n > | and the statement is true for every group of order p”, where 1 < m <n. We 
know that H < Ng(#). We consider the following two cases: 

Case 1: Nc(A) # H. In this case, |Ng(H)| = p”, and so NG(A) = G. This 
means that H <G. 

Case 2: NG(A) = H. Since Z(G) C Ng(A), it follows that Z(G) C H. Since 
p\|Z (G)|, by Cauchy’s Theorem it follows that there exists a € Z(G) such that 
o(a) = p.Take K = (a), then K is anormal subgroup of order p in G. Now, we have 
|H/K| = p"~*, so by our induction assumption, we conclude that H/K < G/K, 
because |G/K| = p"~!. Therefore, H is a normal subgroup of G. a 


Problem 9.37 If G is a group of order p”, prove that G is abelian. 


Solution As a group G is abelian if and only if Z(G) = G, our aim is to show 
that Z(G) = G. Since |G| = p*, by Lagrange’s Theorem, |Z(G)| |p”. This implies 
that |Z(G)| = 1, p or p?. By Problem9.35, we conclude that |Z(G)| 4 1. Assume 
that |Z(G)| = p. Take a € G such that a ¢ Z(G). Then, Cg(a) is a subgroup of 
G and Z(G) is a subset of Cg(a). Since a ¢ Z(G) and a € Cg(a), it follows that 
Z(G) # Cg(a). Hence, we have p = |Z(G)| < |Cg(a)|. Then, we conclude that 
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|ICg(a)| = Pp or Cg(a) = G. This means that all of elements of G commute with 
a, and soa € Z(G), a contradiction. Thus, |Z(G)| = p is not an actual possibility. 
Therefore, |Z(G)| = Pp’, and so G = Z(G). a 


9.5 Supplementary Exercises 


1. (a) Let N be a normal subgroup of a group G, with G/N abelian. Does it fol- 
low that G = HN for some abelian subgroup H of G? Give a proof or a 
counterexample. 

(b) Answer the same question, with both occurrences of “abelian” replaced by 
“cyclic”. 
2. Suppose that K < G with |K| = m. Let x € G and n be a positive integer such 
that (m,n) = 1. Prove that 


(a) If o(x) =n, then o(xK) =n; 
(b) If o(xK) =n, then there is an element y € G such that o(y) =n andxK = 
yK. 


3. Let N be a normal subgroup of a group G. If x, y € G such that xy € N, show 
that yx € N. 

4. Let H be a subgroup of index 2 in a finite group G. If the order of H is odd and 
every element of G \ A is of order 2, prove that H is abelian. 

5. Suppose that G is a finite abelian group such that each element of G has order 
1 or 3. If A is a subgroup of G, show that each element of G/H has order 1| or 
3. Use induction on |G| to show that |G| = 3“, for some positive integer k. 

6. For which values of n is it true that the dihedral group D,, has a pair of proper 
normal subgroups H and K for which D, = HK and H 1 K is singleton. 

7. If K < H < Gand WN <1G, show that the equations HN = KN and HON = 
KN imply that H = K. 

8. Let G be a finite group, H be a subgroup of G, and N I G.If|H| and[G: N] 
are relatively prime, prove that H is a subgroup of N. 

9. Let G be a finite group and N (1G. If |N| and [G: N] are relatively prime, 
prove that N is the unique subgroup of G of order N. 

10. Let G be a group and H be the intersection of all subgroups of G that have finite 
index in G. Show that H is normal in G. 

11. Let G be a group and H be a subgroup of G with finite index. Show that there 
is anormal subgroup N of G for which N < H < G where N also has finite 
index in G. 

12. Prove that the torsion subgroup T of an abelian group G is a normal subgroup 
of G, and that G/T is torsion free. 

13. Let N be a normal subgroup of G of index n. Show that if a € G, thena” € N. 
Give an example to show that this may be false when N is not normal. 

14. If|G| = pq, where p and gq are not necessarily distinct primes, prove |Z(G)| = 1 
or G is abelian. 
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15. 


16. 
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If H and K are subgroups of finite index in a group G, and[G : H] and[G: K] 
are relatively prime, prove thatG = HK. 

If G is a non-abelian group of order p*, where p is a prime number, show that 
|Z(G)| = p. 

Hint: Use Problem 9.34. 


. Let G be a finite group and p be the smallest prime number such that P| |G|. If 


N <G and |N| = p, prove that N < Z(G). 


. If H is asubgroup of finite index in a group G, prove that H contains a subgroup 


N which is of finite index and normal in G. 


. The set X is called a normal subset of G if G is the normalizer of X. Let X bea 


finite normal subset of a group G such that for some positive integer n, x” = e, 
for all x € X. Prove that every element of the group H = (X) may be written as 
a product of not more than (7 — 1)|X| elements of X. In particular, H is a finite 


group. 


Chapter 10 M®) 
Some Special Subgroups cies 


In this section, we introduce the notion of commutator subgroup or derived subgroup 
of a group. This subgroup is important because it is the smallest normal subgroup 
such that the quotient group of the original group by this subgroup is abelian. Also, 
we study other special subgroup of a group called the maximum subgroup. 


10.1 Commutators and Derived Subgroups 


Let G bea group, and let a, b € G. The commutator of a and bis [a, b] = a~'b7'ab. 
“Commutator” is a good word to use: because [a, b] is a kind of measure as to how 

near a to b come to commuting, [a, b] being the identity element of G if and only if 

ab = ba. For any a, b,c € G, we denote [a, b, c] = [[a, b], c]] and b* = a~!ba. 


Theorem 10.1 Let G be a group. For every a, b,c € G, the following hold: 
(2) (aby) = lane b,c) 

(2) [a, bc] = [a, c]la, bl’; 

(3) (a,b,c? [b,c af le, a7} br = 1, 


Proof (1) We can write 


[ab, c] = (ab)“!c~!abe = b“!a7!c7!abe 
= b-!(a7!c~!ac)b(b-!c!be) = [a, c]’[b, c]. 


(2) We have 


[a, bc] a~!(bce)~!abe = a~'c7!b~ abe 
= (a“!c"!ac)c"!(a~!b“ab)c = [a, c][a, by°. 
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(3) We can write 


[a,b"!,c?? = b“'T[a, b-'), c|b = b~'[a'bab™!, cb 


= b-!(ba“|b~!ac“a'bab-!c)b = (a~!b-!ac7!a7!)(bab7!cb) 
= (aca~'ba)~| (bab! cb). 


Hence, we have 


fa, b~!, cl’ = (acaba)! (bab™' cb). (10.1) 
Similarly, we obtain 

[b, c7!, al’ = (bab™'cb)“! (che! ac) (10.2) 
and 

[c,a7!, BJ” = (cbc7!ac)"'(aca™'ba). (10.3) 
Now, by (10.1)—(10.3), the results follows. | 


It is perfectly natural to look at the set of all commutators in a group G. This 
subset may not form a subgroup of G, so we move to the next best thing. 


Definition 10.2 Let G be a group. The subgroup generated by the set {[a, b] |a,b € 
G} is called the commutator subgroup or derived subgroup of G. It is denoted by G’ 
or G or [G, G]. 


A group G is called perfect if G = G’. 
Theorem 10.3 /f G is a group, then G’ SG and G/G' is abelian. 


Proof For every a, b, g € G, we have 


[a’, b’] = [g~'ag, g-'bg] = (g-'ag)'(g~'bg)'(g7!ag)(g7'bg) 
= (g-!a~'g)(g~'b-'g)(g~'ag)(g7 bg) 
= g-'(a"'b-!ab)g = g“'[a, blg = [a, b]*. 


So, we obtain [a, b]’ = [a’, b]. Now let x € G’ be an arbitrary element. Then, we 


can write x = [][a;, b;], where a;, bj € G for all 1 <i <n. Therefore, we have 
i=l 


n 


gixg = (T[ta. bil) =[ [lai bil = Tlte’. bf] eG’. 
i=1 


i=l i=l 


This shows that G’ 1G. 

Now, let aG’ and bG’ be arbitrary elements of G/G’. Since a~'b7!ab € G’, 
it follows that abG’ = (ab) "@, or equivalently aG’bG’ = bG’aG’, and so 
G/G’ is abelian. | 
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Theorem 10.4 Let G be a group and N be anormal subgroup of G. Then, G/N is 
abelian if and only if G' < N. 


Proof Suppose that G/N is abelian. Then, for every a,b € G, we have aNDN = 
bNaN. This implies that abN = baN, or equivalently a~'b-'ab € N. Since N 
contains every commutator [a, b], it follows that G’ < N. 

Conversely, let G’ < N, and assume that aN and DN are arbitrary elements of 
G/N. Since a~'b~'ab ¢€ G’, it follows that a~'b-'ab € N. Hence, we have abN = 
baN. This means thataNbN = bNaN, and so we conclude that G/N is abelian. 


Definition 10.5 A group G is called metabelian if there exists a normal subgroup 
N of G such that both N and G/N are abelian. 


Theorem 10.6 A group G is metabelian if and only if G’ = {e}. 


Proof Let G be a metabelian group; we show that G” = {e}. Since G is metabelian, 
it has a normal abelian subgroup N, and G/N is abelian. Thus, by Theorem 10.4, 
G’' < N. Since N is abelian, it follows that G’ is abelian, and so G” = {e}. 
Conversely, suppose that G” = {e}. We show that G is metabelian. It is clear 
that if G is abelian, then G is metabelian. So, we have only to consider the case 
where G is not abelian, and hence G’ & {e}. Thus, assume that G is not abelian. By 
Theorem 10.3, we know that G’ is a normal subgroup of G. Let a, b € G’. We know 
that aba~'b~' = e since G” = {e} and so we have that ab = ba. Thus, arbitrary 
elements a, b € G’ commute, and so G’ # {e} is a normal abelian subgroup of G. 
Again, by Theorem 10.3, G/G’ is abelian, and so G is metabelian. a 


Theorem 10.7 Let G be a group such that G' is a subset of the center of G. Then, 
[a, bc] = [a, b][a, c], for alla,b,c € G. 


Proof We can write 


[a, be] = a7! (be)~!abe = a“!c"(ac)(c7 a7!) babe 
= (a~!c~!ac)c!(a“!bab)c = [a, c]c™![a, b]c. 


Since [a, b] € G’ andG’ C Z(G), it follows that [a, b] € Z(G). So, we conclude that 
[a, b]c = c[a, b]. Consequently, we get [a, bc] = [a, c][a, b] = [a, b][a, c]. This 
completes the proof. a 


More generally, for subgroups H and K of a group G, we denote [H, K] the 
subgroup of G generated by the set {[h, k]|h € H andk € K}. 


Theorem 10.8 Jf G is a group and H, K are subgroups of G, then 


(1) [H, K]=[K, A; 
(2) [H, K]S HV K. 
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Proof (1) Let [h, k] be an arbitrary commutator in [H, K]. Then, we have 


[h, k] = Ao hk = (ka kh)* = [k, hyo 


implies that [h,k K H]. So, we have [H, K] C [K, H]. In a similar way, we 
obtain [K, H] C K]. Therefore, we conclude that [H, K] =[K, H]. 

(2) Since H v K is the subgroup generated by H U K, it follows that [h, k] € 
HY K, forallh € H andk € K. This shows that [H, K] is asubgroup of H v K. 
Now, let [h, k] be an arbitrary commutator in [H, K] and x € H. Then, by Theo- 
rem 10.1 (1), we have [hx, k] = [h, k'[x, k], andso[h, k]* = [hx, k][x, k]~!. Since 
[hx, k] and [x,k]~! belong to [H, K], it follows that [h, k]}* € [H, K]. Also, let 
y € K, then by Theorem 10.1 (2), we can write [h, ky] = [h, y][h, k]’, and hence 
[h, k]? = [h, 'Th, ky]. Since [h, y]~! and [h, ky] lies in [H, K’], it follows that 
[h, k]’ € [H, K]. Therefore, we conclude that[H, K]S Hv K. | 


Since [K, H] < G and [k,h] € [K, H],, it follows that [k, h]-! € [K, H]. This 
Je 
[H 


Theorem 10.9 [fG is a group and H, K are normal subgroups of G, then[|H, K] < 
AOK. 


Proof Suppose that h € H and k € K are arbitrary. Since H <I G, it follows that 
h-'(k-'hk) € H. Similarly, since K <G, it follows that (h~'k~'h)k € K. Thus, 
we obtain [h, k] © HM K. Consequently, we have [H, K] < HONK. | 


Theorem 10.10 Jf H, K, and N are normal subgroups of a group G, then 
[HK, N]=[H, NI[K, N]. 


Proof Assume that a € H,b € K andc € N. Then, by Theorem 10.1 (1), we can 
write 
[ab,¢) = [2,5 [b, = [a,c 18 el. (10.4) 


Since H and WN are normal subgroups of G, it follows that [a’, c?] € [H, N]. On the 
other hand, we have [b, c] € [K, N]. Now, by (10.4), we conclude that [H K, N] € 
[H, N\[K, N]. 
Conversely, since H and K are subgroups of H K, it follows that 
[H,N| C[HK, Nj and[K,N] C[HK, N]. 


Consequently, we get [H, N][K, N] C [HK, N], and this completes the proof. 
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Exercises 


= 


. Let G be a group and H be acyclic subgroup of G. If G’ < H, show that H IG. 
2. For any group G, show that G’ is the subset of all “long commutators”: 


G' = {ayay...a,a;'ay'...a;! |a; € Gandn > 2}. 


Hint: (aba|b-!)(ede~!d7!) = a(ba~!)b“e(de“!)d7a7" (ab! )bc7! 
(cd7!)d. 
3. Let G be a group and suppose that H and K are subgroups of G. Prove that 


(1) H Gif and only if [H, G] < A; 
(2) If K isasubgroup of H and K <G, then[H, G] < K ifandonlyif H/K < 
Z(G/RK). 


4. Let G = HK, where H and K are abelian subgroups of G. Prove that G’ is 
abelian. 

5. Let N be a normal subgroup of a group G such that NM G’ = {e}. Show that 
N < Z(G). 

6. If H is a subgroup of a metabelian group G, prove that H is metabelian. 


10.2 Derived Subgroups of Some Special Groups 


In this section, we investigate the derived subgroups of symmetric groups, alternating 
groups, quaternion group, general linear groups, special linear groups, and dihedral 
groups. 


Lemma 10.11 Letn > 5 and N <1 A,. If N contains a cycle of length 3, then N = 
Ay. 


Proof Suppose that (x y z) € N. By Theorem5.43, each permutation in A, is a 
product of cycles of length 3. Hence, it suffices to prove that every cycle of length 
3 lies in N. Assume that (a b c) € S, is an arbitrary cycle. Take o € S, such that 
xo =a, yo = band zo =c. Then, we have a l(x yzjo=(abec). 

Ifo € A,, then (a bc) € N, because N < Ay. 

If o € Ay, then o is an odd permutation. Since n > 5, we can consider r and 
s distinct from x, y, and z. So, we have (r s)o € A,. Consequently, we have 
((r so) (x y z)\(r s)o € N, or equivalently o~!(r s)(x y z)(r s)o € N. Since 
(rs) and (x y z) are disjoint cycles, it follows that o~'!(x y z)o € N. This shows 
that (abc) EN. | 


Theorem 10.12 For all n > 2, S) = Ay. 
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Proof Let n > 2 and suppose that o and 7 are two arbitrary permutations in S,,. Let 
0 =0102...0, and T = 7172...7;, where o;’s and 7;’s are transpositions. Then, we 
have 


[o, T] = o'r tors Or ...O0201Ts ...72T10102 ...O07T1T2...Ts5- 


Hence, [o, 7] is a product of 2(r + s) transpositions, and so it is an even permutation. 
This shows that [o, 7] € A,. Therefore, we conclude that S’ < Ay. 

Now, in order to show that A, = S/, we consider the following cases: 

Case I: n = 2. In this case, clearly, we have S’ = Az = {id}. 

Case 2:n = 3. Wehave A3 = {id, (123), (13 2)}. Since there are permutations 
in $3 that don’t commute, it follows that the derived subgroup is not just {id}. Hence, 
it must be all of A3. 


Case 3: n = 4. For distinct i, j,k € {1, 2, 3, 4}, we have 
(é f), GH = GC NEHENGH =C jk’ = Ck j), 


that is, every cycle of length 3 belongs to derived subgroup. Therefore, we conclude 
that Si, = Ag. 

Case 4: n > 5. We know that S’, is a normal subgroup of S,,. Moreover, we proved 
that S’ < A,. On the other hand, we have 


[1 2), (23)] = 0 2)(2 3) 2)223) = (123) ES’. 


This means that S$’ contains a cycle of length 3, and so by Lemma 10.11, we deduce 
that S’ = Ap. a 
Theorem 10.13 We have 

(1) AS = Ay = {id}; 

(2) A = {id, (1 2)(34), (1 3)(24), (1 4)2 3)}; 

(3) A’, = Ay, for alln > S. 


Proof (1) It is straightforward. 

(2) Assume that N = {id, (1 2)(3 4), (1 4)(23), (1 3)(2 4)}. It can be checked 
that NV is anormal subgroup of Aq. Since | A4/N| = 3, it follows that A4/N is abelian, 
and so by Theorem 10.4, we have Ai, < N. Since Ay, is non-abelian, it follows that 
Al, 4 {id}. Hence, there exists a non-identity permutation (a b)(c d) in Aj. Since 
Al, <l Aq and every product of disjoint transpositions are conjugate in Ay, we conclude 
that A, = NV. 

(3) Since A’, J A,, by Lemma 10.11, it is enough to prove that A’, contains a 


cycle of length 3. Since n > 5, assume that o = (a c b) and tT = (b c)(d e). Then, 


we have [rt o] = Tt !a7!ra = (77 !a7'7)a. Also, we have 


tT to!+ =77!(abe)t = (at bt cr) = (ach) =0. 


Consequently, we obtain [7, 7] = o* = (a bc) as desired. | 
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Theorem 10.14 Jf Qs is the quaternion group, then Q. = {1, 1}. 


Proof We know that Qg = {1, —1, J, —I, J, —J, K, —K}. The commutator 
subgroup contains the element 


U, =F = (-1I)(-/JIJ = CT NU/) = K? = -1. 


Similarly, [J, K] = —1 and K, J] = —1. On the other hand, | and —1 commute 
with all elements of Qg, hence [a, —1] = [a, 1] = 1, for all a € Qg. Therefore, 
the commutator subgroup is the subgroup of Qg generated by —1 and 1, which is 
Oy = {1, I}. | 


Theorem 10.15 Let F be a field and n > 2 be an integer. Then, 
SL, (FY =GL, (F)’ = SL,(F), 


except when n = 2 and F consists of 2 or 3 elements. Thus, SL,(F) is perfect (with 
the exceptions listed). 


Proof Let A and B are two arbitrary matrices in GL, (F). Then, we have A~! B~! 
AB € GL, (F)’. Since 


det(A~!B-! AB) = det(A~!) det(B™') det(A) det(B) = 1, 


it follows that A~'B-'AB € SL, (EF). This yields that GL,(F)’ C SL,(F). On 
the other hand, since SL,(F) C GL, (EF), it follows that SL,(F)’ C GL,(F)’. So, 
we have SL,(F)’ C GL, (F)’ C SL, (F). Consequently, it is enough to show that 
SL,(F)' = SL, (EF). To establish this equality, by Theorem 7.41, it is enough to show 
that each matrix of the form F;;(A) with i # j and X € F*, is a product of commu- 
tators. It is easy to check that for any distinct indices i, j, k, 


LEix(A), Fxg Q)] = Bix(—A) Ex (-—D Bi A) Ex) = Ei (A). 


his expresses each £;;(A) as a commutator when n > 3. For n = 2, we have 
a? 0) [4-b]]_-fa? 07 [1 4) Te of 1 2 
0 al’|}O1{| | Oa 01 0 a 01 
_fa o]'f1 -b] "fa o]f1 ob 
~| 0 a7! 0 1 0 a 01 
_ [1 bd -a’) 
=| 9 I . 


So, if F contains an element a such that a 4 0 and a” ¥ 1, then we can express any 
matrix E}2(A) as acommutator by taking b = (1 — a?)~!), and similarly for E (A). 
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This yields that SL,,(F)’ = SL,(F) except when n = 2 and a> =a, for allae F, 
and this happens only when |F| = 2 or 3. a 


Theorem 10.16 Jf D,, is the dihedral group of order 2n, then by Corollary 7.74, we 
have D, = (R, S), where S is a reflection and R is a rotation. Then, D4, = (R?). 


Proof Since [S, R] is S~'R~'SR = SSRR = R’, it follows that R? is a commuta- 
tor. Moreover, since [S, R'] = S~!R~'SR' = SSR'R' = R*, it follows that every 
element of (R?) is a commutator. Now, we prove that every commutator belong to 
(R?). Suppose that A and B are two arbitrary elements of D2,. We consider the 
following cases: 

Case 1: Both A and B are rotations. In this case, we can write A = R' and B = R/, 
for some integers i and j. Since R’ and R/ commute, it follows that [A, B] = I. 

Case 2: A is arotation and B is areflection. Then, we have A = R‘ and B = R/S, 
for some integers i and j. Since B = B™!, it follows that 


[A, B] = A-'B-'!AB = A~'!BAB = R7'R/SR'R!S = RI-'SR'tS. 


By Theorem 7.75, since RS = SR, it follows that [A, B] = R~*! € (R?). 
Case 3: A is a reflection and B is a rotation. In this case, we have [A, B]~! = 
B-'A—'BA. By the case (2), [A, B]~! € (R?), and this implies that [A, B] € (R?). 
Case 4: Both A and B are reflections. Then, we have A = R'S and B = R/S. 
Hence, we obtain A = A~! and B = B™'. So, we conclude that 


[A, B] = A~-'B-!AB = ABAB = (AB)* = (R'SR/S)* 
= (R'R/SS) = RS) € (R?). 


This completes the proof. | 


Exercises 


1. Compute [GL,,(F) : SL,(F)], where F is finite. 

2. Suppose that O, (F) = {A € GL, (F) | A'A = I} and SO,(F) = {A € O,(F) | 
det(A) = 1}. Prove that $O,(F) <1 O,(F) and compute [O,(F) : SO, (F)]. 

3. Determine the derived subgroup of SO, (F). 


10.3. Maximal Subgroups 


A maximal subgroup of a given group G is a proper subgroup of G, that is, a subgroup 
which is neither the identity nor G itself, and in order to be maximal, it can not be 
contained in a larger proper subgroup of G. More precisely: 
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Definition 10.17 A subgroup M of a group G is called a maximal subgroup if 
M & G and the only subgroups containing M are M and G. 


Example 10.18 We know that the subgroups of Z are of the form nZ. Note that 
nZ C mZ if and only if m|n, and so the maximal subgroups of Z are precisely pZ 
for p prime. 


Example 10.19 In each Hasse diagram, the lower subgroup is a maximal subgroup 
in the upper one. 


Theorem 10.20 /f H is aproper subgroup of a finite group G, then there is amaximal 
subgroup of G containing H. 


Proof If H is a maximal subgroup of G, then we have nothing to do. If H is not a 
maximal subgroup, we pick a proper subgroup H; of G such that H C Hj. Again, if 
H, is maximal in G,then we are done; otherwise, we pick a proper subgroup H of 
G such that H C H; C Hp. Continuing in this way, we construct a chain of proper 
subgroups of G: 

HcHCHhc..., 


and this gives a strictly increasing chain of integers 
|H| <|Hi| < |A| <..., 


which is bounded above by |G|, and so must terminate. Consequently, we obtain a 
maximal subgroup containing H. | 


We conclude that in non-trivial finite groups, maximal subgroups will always 
exist. However, not all groups will have maximal subgroups. 


Example 10.21 The rational numbers Q under addition have no maximal subgroups. 
To see the reason, suppose that M is a maximal subgroup of Q. There exists a 
rational number r/s (r,s € Z and s #0) such that r/s ¢ M. Since Z is a proper 
subgroup of Q, it follows that M 4 {0}. Let m/n € M with m,n € Z and mn 0. 
Since m/n € M, it follows that nm/n € M, and som ¢€ M. Since M is a maximal 
subgroup, it follows that 

M + (r/s) =Q. (10.5) 


Considerr/sms € Q. Then, by (10.5), there existh € M andt € Zsuchthatr/sms = 
h-+tr/s.Hence, wehaver/s = msh + tmr = m(sh + tr).Sincem € M, itfollows 
that r/s € M. This is a contradiction. 


Theorem 10.22 The center of a group G is properly contained in every maximal 
subgroup of G having a composite index. 


Proof Let M bea maximal subgroup of G with composite index. Suppose that Z(G) 
is not contained in M. Then, there exists a € Z(G) such that a ¢ M. We consider 


242 10 Some Special Subgroups 


(M, a), the subgroup generated by M U {a}. Since M is a maximal subgroup and 
M c (M, a), it follows that (M, a) = G. Assume that x € G is an arbitrary element. 
Since a € Z(G), it follows that x = a‘b, for some b € M andk € Z. Now, for every 
y € M, we have 


xyx7! = (a*b)y(a‘b)—! = akbyb-!a~ = byb™', 


because a € Z(G). Since b, y € M, it follows that byb7! e€ M. This means that 
xyx~! € M, and so M is anormal subgroup of G. Since G/M has composite order, 
we conclude that G/M has a non-trivial subgroup, say H/M.Hence,{M} C H/M C 
G/M. This yields that M Cc H C G, and this is a contradiction with the maximality 
of M. 

Note that if Z(G) = M, then M is a normal subgroup of G and by the same 
arguments, we obtain a contradiction. | 


Lemma 10.23 [fan element of a group is not contained in any maximal subgroup, 
then the group is cyclic. 


Proof Suppose that G is a group and x € G with x is not in any maximal subgroup 
of G. Since x is not in any maximal subgroup, it follows that (x) is not maximal. 
Moreover, since x is not in any maximal subgroup, it follows that (x) is not a subset 
of any maximal subgroup. This yields that (x) = G. a 


Theorem 10.24 Let G be a finite group. If G has exactly one maximal subgroup, 
then the order of G is a power of a prime. 


Proof First, we show that G is cyclic. Assume that M is the unique maximal subgroup 
of G. Letx € G — M. Then, the subgroup generated by x is contained in no maximal 
subgroup and hence is equal to G. Consequently, G is a cyclic group. 

Clearly, we have |G| > 1. Assume that |G| has more than one prime factor. Then, 
it may be written |G| = p*m, where p is prime, k, m are positive integers and 
(p,m) = 1. Since G is cyclic, ifk is a divisor of |G|, then G has a subgroup of order 
k. In particular, let m = gr with q prime. So, G has a subgroup of order p*r and 
p'—'m. Since (p*r, p'~'m) = 1, neither of these is a subgroup of the other. Since 
the order of each of these is less than that of order G, it follows that each of them 
is a proper subgroup of G. Since there are no proper divisors of |G| greater than 
the order of either of these, no proper subgroup contains either of them. Therefore, 
we conclude that both of these are maximal subgroups. Since G is defined to have 
exactly one maximal subgroup, the assumption that |G| has more than one prime 
factor must be invalid. a 


Theorem 10.25 /f a finite group has exactly two maximal subgroups, then it is a 
cyclic group. 


Proof Suppose that G is a finite group. Let M; and Mz be the only distinct maximal 
subgroups of G. By Lagrange’s Theorem, if a group is finite, the order of any of its 
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proper subgroups can be no greater than half the order of the group. So, we have 
|M,| < |G|/2 and M2| < |G|/2. Therefore, we can write 


|My U Mp] < |Mi| + |Mo| — 1 < |G|/2 + |G|/2—1=|G|—1< |G]. 


Consequently, there exists x € G such that x ¢ M,; U Mp. Hence, by Lemma 10.23, 
we conclude that G is cyclic. a 


If M is a maximal subgroup of a group G, then also every conjugate aMa~! of 
M is maximal in G. Indeed, aMa~! C H < G implies that M < a~'Ha < G. For 
this reason the maximal subgroups are studied up to conjugation. 


Exercises 


1. Let M be a maximal subgroup of a group G. Prove that if M <1 G, then[G : M] 
is finite and equal to a prime number. 

2. Let G be the additive group consisting of all rational numbers. Prove that G 
contains no maximal subgroup. 

3. Let G be a group in which each proper subgroup is contained in maximal sub- 
group of finite index in G. If every two maximal subgroups on G are conjugate 
in G, prove that G is a cyclic group. 

4. If G is a perfect group and M is a maximal subgroup of G, prove that 


(1) M contains the center of G; 
(2) M/Z(G) is maximal in G/Z(G). 


10.4 Worked-Out Problems 


Problem 10.26 If > 5, prove that A,, is the unique proper non-trivial normal sub- 
group of the symmetric group S,. 

Solution Suppose that N is a proper non-trivial normal subgroup of S,, and let 
o be a non-identity permutation in N. Then, there exists i such that ia Ai. We 
choose j # i, io. Now, if rT = (i j), then a = ota !7~! is non-identity and lies 
in N. Moreover, « is a product of the transpositions oro~! and r. Consequently, 
it is either a cycle of length 3 or a permutation of the form (a b)(c d). Since N is 
normal, it contains either all cycles of length 3 or all permutations of type (a b)(c d). 
Therefore, by Theorem5.43, we deduce that N = Ay. | 


Problem 10.27 The fact that the set of all commutators in a group need not be a 
subgroup is an old result; the following example is due to P.J. Cassidy (1979). 


(a) Let F[x, y] denote the ring of all polynomials in two variables over a field F, 
and let F[x] and F[y] denote the subrings of all polynomials in x and in y, 
respectively. Define G to be the set of all matrices of the form 
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1 f(x) A(x, y) 
A=|0 1 gly) |. 
0 O 1 


where f(x) € F[x] and g(y) € Fy] and A(x, y) € F[x, y]. Prove that G is a 
multiplicative group and that G’ consists of all those matrices for which f(x) = 
0= g(y). 
(b) If (0, 0, 2) is acommutator, show that there are polynomials f(x), f’(x) € FLx] 
and g(x), g(x) € FLy] with h(x, y) = fx)ge’(y) — f'(x)aQ). 
Show that h(x, y) = x7 + xy + y does not posses a decomposition as in part 
(b), and conclude that (0, 0, 2) € G’ is not a commutator. 


(c 


wa 


solution (a) If A denoted by triple (f, g, 4), then 
AEM gs h=F+ fists hth + fe’). (10.6) 


Since G is a subset of a matrix group, we need only to check the subgroup axioms, 
i.e., it is not necessary to check the associativity. The operation in (10.6) shows 
that the multiplication of two elements of G is also in G. It is clear that 1; € G 
with f(x) = h(x, y) = g(y) = 0. Moreover, if f+ f’ =0, g+ 9’ =0 and h+ 
h’ + fg’ = 0, then we get f’ = f, g’ = —g andh’ = —h+ fg. This yields that the 
element (— f, —g, —h + fg) is the inverse of (f, g, 1). Hence, we conclude that G 
is a group. 
Ifh = h(x, y) =) ajjx‘y/, then 


(0,0,h) = | [[(aijx', 0,0), (0, »/, 0)]. 
ij 


This shows that G’ consists of all those matrices for which f(x) =0= g(y). 
(b) Let (0, 0, 4) be a commutator. Then we can write 


(0,0,4) = (f, g, hf’, ga, g. hy ' Cf. 8's hay 


=(fig Aiy(f', 8 h2)(—f, -g, —hi + fg)(—f'’, -g', —ho + f’s") 
=(ft+f', = g, m4 ho + fe (-f —f',-g—9',-hi —hot+ fet f's' + fe’) 
= (0,0, fe’ + fe’ + fet f's +(f+ fs - 8')) 
= (0,0, fg’ — f’g). 


(c) If f(x) = y. b;x' and fix%= ba c;x', then there are equations 
AO, y) = fOsg'(y) — f'O)gQ), 


6) (6) 6) 
a h(x, »)| 7 =f @)|_, g(y)— ra ha ; 
XxX x=0 = = 


2 
Fed = S60] _20)- 2025/0), 
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Hence, we obtain the following three equations 


bog'(y) — cog(y) = y’, 
big'(y) —cigiy) =y, 
bog'(y) — cag(y) = 1. 


Considering F[x, y] as a vector space over F, one obtains the contradiction that the 
independent set {1, y, y~} is in the subspace spanned by {g, g’}. 


10.5 Supplementary Exercises 


ee 


. If N is anormal subgroup of the group G, prove that (G/N)’ = G'N/N. 
. Let a and b be two elements of order m and n, respectively, in group G. Prove 


that if a and b both commute with [a, b], and d is the greatest common divisor of 
m and n, then [a, b]¢ =e. 


. Suppose that G is a group and |G| = p”, where p is a prime number. If [G: 


Cc(x)] < p, for all x € G, prove that 
(a) Cg(x) 2G, for all x € G; 

(b) G' < Z(G); 

(c) |G'| < p. 


. Let G be a subgroup of a group G. If [H, G’] = e, prove that [H’, G] = e. 
. Let p be prime and let G be a non-abelian group of order p*. Show that Z(G) = 


[G, G] and this is a subgroup of order p. 


. LetK <M <G,with K <JG. Prove that M/K is a maximal subgroup of G/K 


if and only if M is a maximal subgroup of G. 


Chapter 11 ®) 
Group Homomorphisms cies 


A homomorphism is a function between groups satisfying a few natural properties. 
A homomorphism that is both one to one and onto is an isomorphism. This chapter 
presents different isomorphism theorems which are important tools for proving fur- 
ther results. The first isomorphism theorem, that will be the second theorem to be 
proven after the factor theorem, is easier to motivate, since it will help us in comput- 
ing quotient groups. Cayley’s Theorem states that a permutation group of a group is 
isomorphic to the given group. 


11.1. Homomorphisms and Their Properties 


Here are the Cayley tables for a cyclic group of order 4 and Ujo: 


: @é aaa 1 3 7 9 
ele aaa 1 1 3 7 9 
alaa@aa@a ee 3} 3 9 1 7 
ala a e a 7/7 1 9 3 
ola ea a 9|9 7 3 1 


There is obvious sense in which these two groups are “the same’. Indeed, we can 
obtain the right table from the left table by replacing e, a, a” and a? with 1, 7, 9 and 
3, respectively. Then, although the two groups look different, they are essentially 
the same. We may think of saying two groups are the same if it is possible to obtain 
one of them from the other by substitution as above. One way to implement a sub- 
stitution is to use a function. In a sense, a function is a thing which substitutes its 
output for its input. We will define what it means for two groups to be the same by 
using certain kinds of functions between groups. These functions are called group 
homomorphisms; a special kind of homomorphism, called an isomorphism, will be 
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used to define sameness for groups. The term homomorphism comes from the Greek 
words homo, “like”, and morphe, “form”. 


Definition 11.1 Let G and H be groups. A function f : G > AH is called a homo- 
morphism if 


f(ab) = f@f), 


for all a,b € G. Here, the multiplication in ab is in G and the multiplication in 


f(a) f(b) isin H. 


This definition can be visualized as shown in Fig. 11.1. The pairs of dashed arrows 
represent the group operations. 

A short description of a homomorphism is that it preserve the operation of G. 
The set of all homomorphism from G to H is denoted by Hom(G, H). This set is 
always non-empty because it contains the zero homomorphism, the homomorphism 
which sends every elements of G to the identity element of H. 

In the definition of homomorphism, we assumed multiplicative notation for the 
operations in both G and H. If the operation in one or both is something else, we 
must adjust the definition accordingly. For instance, see Table 11.1. 

Before working out some facts about homomorphisms, we present some examples. 


Example 11.2 For any pair of groups G and H, one can always define the trivial 
homomorphism. This is the rather uninteresting function that maps every element of 
the domain to the identity element in the range. 


Fig. 11.1 Homomorphism between two groups 


Table 11.1 Operations of groups in a homomorphism 


Operation in G Operation in H Homomorphism definition 
a at: faty=fa+fo) 
a : faty) = f@)- fo) 
$ + fa-y) = fa) + fO) 
* x faxy) = f@) x fO) 
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Example 11.3 Let Z be the group of integers under addition and Z,, be the group 
of integers under addition modulo n. If we define f : Z— Z, by f(x) equals to 
remainder of x on division by n, then n is a homomorphism. 


Example 11.4 If G = (a), then f : Z — G defined by f(m) = a” is a homomor- 
phism. 


Example 11.5 Let H denote the group {1, —1} under multiplication (the 1 and —1 
here are just the ordinary numbers, H is a group of order 2). We may define a function 
f from the group Z of integers under addition to H by 


if n is even 


1 
foy= | if nis odd. 


Then f is ahomomorphism. 


Example 11.6 The logarithm function is homomorphism from the group of positive 
real numbers under multiplication, to the group of all real numbers under addition. 


Example 11.7 Let the circle group H consists of all complex numbers z such that 
|z| = 1. We can define a homomorphism f from the additive group of real numbers 
Rto H by f(@) =cos6é +i sin@. Geometrically, we are simply wrapping the real 
line around the circle in a group-theoretic fashion. 


Example 11.8 Let R[x] denote the group of all polynomials with real coefficients 
under addition. For any p € R[x], let p’ denote the derivative of p. Then, the function 
p+ p’ isahomomorphism from R[x] to itself. 


Example 11.9 Let G = C((0, 1]) be the additive group of all continuous real-valued 
functions. Then, the integration function from G to R given by f th ie Ff (x)dx is 
a homomorphism. 


Theorem 11.10 Jf G and H are groups and f : G — H is a homomorphism, then 


(1) fe) =e, where e on the left is the identity in G and e on the right is the identity 
in H; 
(2) f(a!) = f(@"|, foralla € G. 


Proof (1) Since ee = e, it follows that f(e) f(e) = fle) = flee) = fle) = flee. 
So, by cancellation property in H, we obtain f(e) = e. 

(2) Leta € G bean arbitrary element. Since f (aa~!) = f(e) =e, it follows that 
f@f (a~'!) =e, and so by the definition of inverse, we conclude that f (a7!) = 


fay. = 


Informally, we can speak about a homomorphism as a function that respects 
structure. A homomorphism of groups “respects” the property of the identity element, 
multiplication, and inversion. 


Theorem 11.11 Jf G and H are groups and f : G — H is a homomorphism, then 
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(1) For any integern anda € G, f(a") = f(a)"; 
(2) For anya € G, if the order of a is finite, then o(f (a)) |o(a). 

Proof (1) It follows from the first part of Theorem 11.10 trivially when n = 0, and 
by mathematical induction for n > 0. If n < 0, then put m = —n. Hence, we obtain 
(2 =fE Af) =f@ "=f @ =F a". 

(2) If o(a) = n, then a” = e, and so we obtain f(a”) = f(e) = e. This implies 
that f(a)” = e. Thus, we conclude that o( f (a)|n. | 


Remark 11.12 If the operation of H is addition, then the property (1) in Theo- 
rem11.11 becomes f(a”) = nf (a). If both the operation of G and H are addition, 
then the property (1) becomes f (na) = nf (a). 


Example 11.13 We want to determine all homomorphisms from Zz, to Z1g. Suppose 
that f : Z24 — Z)g be ahomomorphism. For every non-negative integer m, we have 


f@) = fC41t+-- +D=fO+ (M+: + fO = ms, 
m times m times 
for all 0 < m < 23. This means that a homomorphism f is completely determined 


by the value f(1). Let f(1) = 7, where 0 < n < 17. In additive group Z)g, we can 
write 


24n = 24n = 24f (1) = f24 = fO. 


Since any homomorphism maps the identity to identity, it follows that f(0) = 0. 
Hence, 24n = 0. This yields that 18|24n or 3|4n. Since (3, 4) = 1, it follows that 3|n, 
and so we conclude that n € {0, 3, 6, 9, 12, 15}. Therefore, any homomorphism 
must send I to 3k, fork = 0, 1,..., 5. This shows that there exist at most six possible 
homomorphisms from Z4 to Z;g, namely fo, fi,..., fs, where f,(m) = 3km, for 
every k =0,1,...,5. 

Now, we investigate that for each 0 < k <5, f;, is well defined. Assume that 
m,m' € Z such that m = m’ in Zo4. Then, we have 24|m — m’, and so 3km — 3km’ 
is a multiple of 72. It follows that 3km — 3km’ is a multiple of 18. Thus, 3km = 3km’ 
or f(T) = film’). 


In the rest, we show that f; is a homomorphism. If m, m’ € Z, then we have 


fi +m’) = fx(m +m’) = 3k(m +m’) 
= 3km + 3km’ = f(m) + f(m’). 


Therefore, each f; is ahomomorphism from Z4 to Zig. 


Since homomorphisms preserve the group operation, it should not be a surprise 
that they preserve many group properties. 


Theorem 11.14 Let G and H be groups and f : G — H be a homomorphism. 
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(1) The image of a subgroup of G is a subgroup of H. In particular, f (G) (or another 
notation, Imf ) is a subgroup of H; 
(2) The inverse image of a subgroup of H is a subgroup of G. 


Proof (1) Let A be a subgroup of G. Since e = f(e) € f(A), it follows that f(A) 
is non-empty. Let f(a) and f(b) be two arbitrary elements of f(A). Then, we have 
f(@ f(b)! = f(ab~'). Since A is a subgroup, it follows that ab~! € A. Therefore, 
we conclude that f(a) f(b)~' € f(A), and so f(A) is a subgroup of H. 

(2) Let B be a subgroup of H. Again, since f(e) =e € B, it follows that 
e € f—'(B). Hence, f~'(B) is non-empty. Assume that x and y are two arbitrary 
elements of f~'(B). Then, we have f(x) € B and f(y) € B. Since B is a subgroup, 
it follows that f(x) f(y)~! € B, and so f(xy~!) € B. Therefore, we conclude that 
xy! € f~'(B). This completes the proof. | 


Theorem 11.15 Let G and H be groups and f : G > H be a homomorphism. If 
A<l Gand B SH, then f(A) < f(G) and f-'(B) SG. 


Proof By Theorem 11.14, we know that f(A) < f(G) and f '(B) <G. 

Now, let f(a) € f(A) and f(x) € f(G) be arbitrary. Then, we can write 
f(x) fla) f(x)! = f(xax7!). Since A <1G, it follows that xax~! € A. So, we 
conclude that f(x) f(a) f(x)7! € f(A). This shows that f(A) <I f(G). 

Finally, suppose that x € G and y € f~!(B) are arbitrary. Then, we have f(x) € 
H and f(y) € B. Since B <1 H, it follows that f(x) f(y) f(x)7! € B, or equiva- 
lently f(xyx~'!) € B. This implies that xyx~! €¢ f~'(B), andso f-'(B) IG. Wf 


The following is an important concept for homomorphisms. 


Definition 11.16 The kernel of ahomomorphism f from a group G to a group H 
is the set {x € G| f(x) = e}. The kernel of f is denoted by Kerf. 


Example 11.17 Let D(R) be the additive group of all differentiable functions, 
f:R—R, with continuous derivative. Let C(R) be the additive group of all 
continuous functions f : R > R. Suppose that F : DCR) > C(R) be defined by 
F(f) =df/dx. Then 


(1) F is a homomorphism; 
(2) KerF ={f € D(R) | df/dx = 0}, which is the set of all constant functions. 


The following is a fundamental theorem about the kernel of a homomorphism. 


Theorem 11.18 Jf G and H are groups and f : G — H is a homomorphism, then 
Kerf is anormal subgroup of G. 


Proof Since f (e) = e, it follows that Kerf is non-empty. Now, we suppose that a 
and b are two arbitrary elements of Kerf. Then, we have f(a) = e and f(b) =e. 
Consequently, we get f(ab~!) = f(a) f(b)! = ee"! = e. This shows that ab~! € 
Kerf and so Kerf is a subgroup of G. On the other hand, for any x € G, we 
have f(xax~!) = fx) f(@f)7! = f@ef (x)! = e. This yields that xax~! € 
Kerf. Therefore, we deduce that Kerf is also a normal subgroup of G. a 
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Kerf = f~*({e}) 


bKerf = f-'({a'}) 


Fig. 11.2 Illustration for the kernel 


Theorem 11.19 Jf G and H are groups and f : G — H is a homomorphism, then 


(1) f(a) = f(b) ifand only ifaKerf = bKerf; 
(2) If f(b) =a’, then f-'({a’}) = {x € G| f(x) = a@’} = bKerf. Lookat the illus- 
tration in Fig. 11.2. 


Proof (1) We have 


f(a) = f(b) & fb) 'f@ =e fb'a)=e 
© b'ae Kerf saKerf =bKerf. 


(2) We must show that f~!({a'}) C bKerf and bKerf C f—'({a’}). To demon- 
strate the first inclusion, assume that x € f~'({a’}) is an arbitrary element. Then, 
we have f(x) =a’, and so f(x) = f(b). Now by part (1), we conclude that 
xKerf = bKerf,and hence x € bKerf. 

To prove the second inclusion, suppose that x € Kerf is an arbitrary element. 
Then, we have f(bx) = f(b) f(x) = a’e =a’ or bx € f—'({a'}). This completes 
the proof. | 


Theorem 11.20 Let G and H be groups and f :G — H be a homomorphism. 
Then, f is one to one if and only if Kerf = {e}. 


Proof Suppose that f is one to one. If x € Kerf, then f (x) = e. On the other hand, 
we always have f(e) = e. Now, since f(x) = f(e) and f is one to one, it follows 
that x = e. This proves that Kerf = {e}. 

Conversely, let Kerf = {e} and assume that a and b are elements of G such that 
f(a = f(b). Then, fla f(b)! = e, or equivalently f(ab~'!) = e. This means that 
ab—' € Kerf = {e}, and soa = b. Hence, f is a one to one function. | 
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Exercises 


1. Let X be a non-empty set of generators of G. Let f: G > H andg:G— H 
be homomorphisms. If for every x € X, f(x) = g(x), prove that f = g. 
2. Let G and H be groups and f : G — H be a homomorphism. Show that 
fix, yD) =[f@), fQ)) forall x, y € G. 
3. Describe all the homomorphisms from Z) to itself. 
4. Let f : Zj — Zz be a homomorphism such that f(1) = 4. Compute f (5). 
What is the kernel of f? 
5. Show that there is a homomorphism from S, to Z2 whose kernel is Ay. 
6. Determine all homomorphisms from Z onto $3. Determine all homomorphisms 
from Z to $3. 
7. Find all group homomorphisms from Z, into Zo. 
8. How many homomorphisms are there from Z29 onto Zg? 
9. If Gisa finitely generated group by a set with n elements and H is a finite group, 
prove that |Hom(G, H)| < |A|”. 
10. If f isa homomorphism from Z39 onto a group of order 5, determine the kernel 
of f. 
11. Let G and H be groups and f : G — H be an onto homomorphism. If X is a 
subset of G that generates G, show that f(X) generates H. 
12. Find a homomorphism f from U39 to U39 with kernel (1, 11} and f (7) =7. 
13. Let Z[x] be the group of polynomials in x with integer coefficients under addi- 
tion. Prove that the function from Z[x] into Z given by f(x) & f (3) isahomo- 
morphism. Give a geometrical description of the kernel of this homomorphism. 
14. Prove that the function from R under addition to GL2(R) that takes x to 


cosx sinx 
—sinx cosx 


is ahomomorphism. What is the kernel of this homomorphism? 
15. Let G be a subgroup of some dihedral group. For each x € G, we define 


$o= 1 if x is a rotation 
—1 if x isa reflection. 
Prove that f is a homomorphism from G to the multiplicative group {1, —1}. 
What is the kernel? 

16. (a) Letn > 5 andm > 3. Suppose that m is not divisible by 3. Let f : S, > Dn 
be a homomorphism. Prove that A, is contained in the kernel of {. How 
many homomorphisms S, — D,, are there? 

(b) Let n > 5 and m > 3 be arbitrary, so m may be divisible by 3. How many 
homomorphisms S, — D,, are there? 
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11.2 Isomorphism Theorems 


We wish to have a way to determine if two groups have similar properties. The 
advantage of this is that if we could tell that two groups G and H have similar 
properties and we already know all the properties of G, then we would immediately 
know all the properties of H. The tool which will allow us to do this is called an 
isomorphism, from the Greek words “isos” which mean “same” and “morphe” which 
means “form”. 


Definition 11.21 Let G and H be groups. A function f : G > H is said to be an 
isomorphism if the following conditions are satisfied: 


(1) f is ahomomorphism; 
(2) f is one to one; 
(3) f is onto. 


If there is an isomorphism from G onto H, we say that G and H are isomorphic and 
write G = H. 


Example 11.22 If G is a group, then the identity function id : G > G defined by 
id(x) = x, for all x € G, is an isomorphism. 


Example 11.23 The group of real numbers R under addition and positive real num- 
bers R* under multiplication are isomorphic. Indeed, if we consider f : R > Rt 
by f(x) = e*, then for every real numbers a and b we have f(a +b) = e@*? = 
e“e? = f(a) f(b). So, f is a homomorphism. Moreover, by the well-known results 
of calculus, f is one to one and onto. 


An injective (or one to one) homomorphism is called a monomorphism and a 
surjective (or onto) homomorphism is called an epimorphism. Of course a bijective 
homomorphism is what we have been calling an isomorphism. A group H is said to 
be a homomorphic image of a group G, if there exists an epimorphism from G onto 
H. 

Let G and H be groups and f : G > H bea homomorphism. Clearly, we have 


(1) f is a monomorphism if and only if Kerf = {e}; 
(2) f is an epimorphism if and only if mf = H; 
(3) f is an isomorphism if and only if Kerf = {e} and Imf = H. 


How do we demonstrate that two groups G and H are not isomorphic, if this is the 
case? A structural property of a group is one that must be shared by any isomorphic 
group. It is not concerned with names or some other non-structural characteristics 
of the elements. In order to prove that two groups G and H are not isomorphic, 
one needs to demonstrate that there is no isomorphism from G onto H. Usually, in 
practice, this is much easier than it sounds in general, and is accomplished by finding 
some structural property that holds in one group, but not in the other. 


Example 11.24 Z, 4 S3 because Z¢ is abelian and S3 is not abelian. 
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Example 11.25 The dihedral group Dj» is not isomorphic to S4 because D)2 has 13 
elements of order 2 (12 reflections and the rotation for 7), while S4 has only 9 such 
elements (transpositions and product of disjoint transpositions). 


Lemma 11.26 Let G and H be groups and f : G — H be an isomorphism. Then, 
f—! :H = Gis also an isomorphism. 


Proof Since f is a bijective function, by Theorem5S.5, it follows that f—! exists 
and it is a bijective function from H onto G. So, it remains to be seen that f—! is 
a homomorphism. Assume that x and y are arbitrary elements of H. Then, there 
exist a, b € G such that f(a) = x and f(b) = y. This yields that a = f—'(x) and 
b = f~'(y). Hence, we can write xy = f(a) f(b) = f (ab), and so f~!(xy) = ab. 
This shows that fly) 4 fo @)Ff (y). Therefore, f° is ahomomorphism. 


Lemma 11.27 If f : G— H and g: H — K are isomorphisms between groups, 
then so is the function g o f. 


Proof By Theorem5.6, gof is a bijective function from G onto K. To check that 
go f isahomomorphism, let a and b be arbitrary elements of G. Then, we obtain 


(g 0 f)(ab) = g(f(ab)) = g(f(a)f)) 
= (f(a))g(f)) = (go P@(go fb), 


where we used in turn the facts that f and g are homomorphisms. Therefore, go f 
is an isomorphism. | 


Theorem 11.28 The relation of isomorphism between groups is an equivalence rela- 
tion on the family of all groups. 


Proof The result follows by Lemmas 11.26 and 11.27. | 


Lemma 11.29 Let G and H be two cyclic groups of the same order. Then, G and 
Hare isomorphic. 


Proof Suppose that G = (a) and H = (b). If x € G, then x = a’, for some integer 
i. Define f : G > H by f(x) =D’. 

We first have to check that f is well defined. If G is infinite, then so is H and every 
element of G may be uniquely represented in the form a‘. Thus, f is automatically 
well defined in this case. Now, assume that G has order n, and suppose that x = a/, 
too. We have to check that b! = b/. 

Since a’ = a/, it follows that a'~/ = e, andsom|i — j. Since |H| = m, it follows 
that i — j = k|A|, for some integer k. Consequently, we can write b'~/ = pKl#! — 
(b\# I) = e. This shows that b' = b/. Therefore, we conclude that f is well defined. 
Now, let a’ and a/ be two elements of G. Then, we have 


fia) = fa’) =b = bib! = fa@)fa). 
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Fig. 11.3 Vertices of the 
cube with the same number 
are endpoints of the four 
diagonals of the cube 


This means that f is a homomorphism. Moreover, the function g : H — G defined 
by g(b') =a’ is the inverse of f, and so f is a one-to-one correspondence. This 
completes the proof. | 


Corollary 11.30 Any infinite cyclic group is isomorphic to the additive group Z. 
Any finite cyclic group of order n is isomorphic to Z,, the additive group of modulo 
n. 


Proof The result follows directly from Lemma 1 1.29. | 


Example 11.31 (The Rotation Group of a Cube) A cube has 8 vertices, 12 edges, 
and 6 faces (see Fig. 11.3), each of which is a square. The cube has exactly 24 
rotational symmetries, which comprise: 


(1) 1 trivial rotation or the identity symmetry; 

(2) 9 non-trivial rotations (by 2/2, 2 and 37/2) about 3 axes joining the centers of 
opposite faces; 

(3) 8 non-trivial rotations (by 27/3 and 47r/3) about the 4 great diagonals; 

(4) 4 non-trivial rotations (by zr) about the 4 axes joining the midpoints of opposite 
edges. 


Since the group of rotations of a cube has the same order as S4, we need only prove 
that the group of rotations is isomorphic to a subgroup of S4. We number the vertices 
of the cube from | to 4, and where opposite vertices are given the same number. Now 
a cube has 4 diagonals and any rotation induces a permutation of these diagonals. 
But we cannot just assume that different rotations correspond to different rotations. 
Observe that vertices with the same number are endpoints of the four diagonals of the 
cube. We can number the diagonals according to their endpoints. The permutations 
of the numbers of vertices in the front face of the cube correspond to permutations of 
the diagonals. Therefore, there is a one to one correspondence between the rotations 
of the cube and the permutations of the diagonals of the cube. The composition of 
rotations coincides with the product of permutations and so the group of rotations of a 
cube is isomorphic to the symmetric group S4. We see two perpendicular axes where 
x /2 rotations give the permutations 0 = (1234) andt = (1432). These induce the 
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subgroup {id, o, o*, 0, t7, t20, t70”, t’o7} and the subgroup {id, oT, (ot)°}. 
Consequently, the rotations induce all 24 permutations since 24 = (8, 3). 


There are three theorems, formulated by E. Noether, describing the relationship 
between factor groups, normal subgroups and homomorphisms. These theorems are 
based on a basic result on homomorphisms presented in the following lemma. 


Lemma 11.32 [f N is a normal subgroup of a group G, then the canonical map 
ma :G-— G/N, given by x(a) = aN, is an onto homomorphism with kernel N. 


Proof Clearly, we have m(ab) = abN =aNbN =ax(a)r(b), for all a,be G. 
Moreover, since every element of G/N is of the form aN, for some a € G, it fol- 
lows that z is onto. Finally, for each a € G, we have a € Kermz if and only if 
(a) =aN = N ifandonlyifae N. a 


Theorem 11.33 Let G and H be groups and f : G > H bea homomorphism and 
suppose that N is anormal subgroup of G satisfies N < Kerf. Then, there exists a 
unique homomorphism g : G/N — H which satisfies gon = f. 


Proof Let aN bea left coset of N in G. Choose any x € aN, then x = ay, for some 
y € N. Moreover, we have f(x) = f(ay) = f@af(v) = f(@Me = f(a), because 
N < Kerf. Therefore, f has the same effect on every element of the coset aN. So, 
if we define g: G/N — H by g(aN) = f(a), foralla € G, then g is well defined. 
Now, assume that aN and DN are arbitrary elements of G/N. Then, we have 


g(aNbN) = g(abN) = f(ab) = fa fb) = g(aN)g(bN). 


Hence, g is ahomomorphism such that g o zm = f. At the end, we prove the unique- 
ness. Suppose that g, g’: G/N — H are homomorphisms which satisfy g oz = f 
and g’oz = f. Then, we have g(z(a)) = g' (x(a), for all a € G. Since z is onto, 
any element of G/N has the form z(a), for some a € G. Thus, we conclude that 
g = g’. This completes the proof. | 


Theorem 11.34 (First somorphism Theorem) Let G and H be groups and f : G > 
H beahomomorphism. Then, there exists anisomorphism g : G/Kerf — Imf such 
that goa = f. In particular, if f is onto, then G/Kerf = H. In this case, we say 
the diagram presented in Fig. 11.4 is commutative. 


Proof Since Imf is a subgroup of H, without loss of generality we may assume 
that H = Imf. With N = Kerf we conclude that by Theorem 11.33 that there is 
an onto homomorphism g : G/N — Imf given by g(aN) = f(a), for alla € G. 
Since Kerg = {aN | f(a) =e} = {aN |a € N} = {N}, it follows that g is one to 
one. This completes our proof. | 


Theorem 11.34 also is called the fundamental theorem of homomorphism, and it 
is one of the most basic theorems in group theory. 
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Fig. 11.4 Commutative diagram for the first isomorphism theorem 


Fig. 11.5 The second 


AK 
isomorphism theorem / \ 
HT kK 
BOE 


Example 11.35 Let f : Z — Z, be the homomorphism defined in Example 11.3. 
We obtain Imf = Z, and Kerf = nZ, and so we conclude that Z/nZ = Z,,. 


Example 11.36 The function f : R — C* given by f(x) = e?"*, for all x € R, 
is a homomorphism. We obtain Imf = {z € C | |z| = 1} := S!, the unit complex 
numbers and Kerf = Z. Therefore, we can write R/Z = S!. 


Theorem 11.37 (Second Isomorphism Theorem) Let G be a group and H, N be 
subgroups of G. If NG, then HN N is normal in H and H/(HQN) = HN/N. 


The second isomorphism theorem can be represented pictorially as in Fig. 11.5. 


Proof Since N is anormal subgroup of G, it follows that NH = HN, and so HN 
is a subgroup of G and also we have N < HN. Moreover, for every x € HN, there 
existh € H andn € N suchthatxNx~! = hnNn~'h7! = hNh7! = N. This yields 
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that NV < HN. Now, we define a function f : H > HN/N by f(h) =AN, for all 
h € H. Clearly, f is a homomorphism. Consider xN €¢ HN/N, where x € HN. 
Then, x = hn, for some h € H and n € N. Hence, we have xN = hnN =hAN = 
f(A). This shows that f is onto. Now, by the first isomorphism theorem, we obtain 
H/Kerf = HN/N. If we can establish that Kerf = HN, we shall obtain that 
HN is anormal subgroup of H and H/(H NN) = HN/N. Indeed, we have 


Kerf ={(he H| fh) =Nj={he H|hN=N} 
={heH|heNJ=ANN. 


This completes the proof. | 


Theorem 11.38 (Third Isomorphism Theorem) Let H and N be normal subgroups 
of a group G such that N < H. Then, H/N is a normal subgroup of G/N and 
(G/N) /(H/N) = G/H. 


Proof We define f : G/N — G/H by f(aN) =aH, for every a € G. Since f 
is defined on cosets, we should check that f is well defined. To begin with, if 
aN = DN, thena~'b e€ N. Since N < H, it follows that a~'b € H or aH = bH. 
This shows that f is well defined. For every aN and bDN in G/N, we have 
F(aNbN) = f(abN) = abH = aHbH = f (aN) f (bN). Hence, f is ahomomor- 
phism. Clearly, f is onto, for if aH € G/H, then f(aN) = aH. Furthermore, we 
have 


Kerf = {aN € G/N | f(aN) = H} = {aN € G/N | aH = H} 
={aN ¢€G/N|ae H}=H/N, 


as required. The result now follows by the first isomorphism theorem. | 


We may picture the third isomorphism theorem as illustration in Fig. 11.6. 


Exercises 


1. Find a group which is isomorphic to one of its proper subgroups. 

2. Let G = R \ {—1} and define a binary operation on G byaxb=a+b-+ab. 
Prove that G is a group under this operation. Show that (G, *) is isomorphic to 
the multiplicative group of non-zero real numbers. 

3. Prove that 


(a) The multiplication groups R* and C* are not isomorphic; 
(b) The additive groups R and Q are not isomorphic; 
(c) The additive groups Z and Q are not isomorphic; 


4. Prove that every cyclic group of finite order n is isomorphic to the multiplicative 
group of all complex nth roots of 1. 
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Fig. 


12. 


13. 
14. 


15. 


Q 
x 
Q 


JIN H/N/ (G/N) /(H/N) © G/H 


11.6 Illustration for the third isomorphism theorem 


. Let 1 <n <3. Prove that any two groups containing exactly n elements are 


isomorphic. 


. If Gis anon-abelian group of order 6, prove that G = 83. 
. Let X; and X, be two sets. Suppose that there exists a one to one correspondence 


between X, and X2. Show that there exists an isomorphism of Sy, onto Sx,. 


. Prove that any group of order 4 is isomorphic to the group Z, or to the group 


K ={id, (12)3.4), (1 3)(24), (14)(2 3}. 


. IsQ/Z = Q? 
. Show that the groups Q/Z and R/Q can not be isomorphic. 
. Let f, g: R — R be real valued functions defined by f(x) = 1/x and g(x) = 


(x — 1)/x. Then, f and g generate a group G with the operation given by 
function composition. Prove that G = $3. 
Let X be a non-empty set and G = {f | f : X — Zp}. Show that 


(a) G is a group addition of functions; 
(b) P(X) is a group under the binary operation 


AA B=(AUB)—(ANB); 


(c) (P(X), A) = (G, +). 


Show that the nth roots of unity are isomorphic to Z,,. 

Show that to each positive integer n there exist only a finite number of pairwise 
non-isomorphic groups of order 7. 

Let G be a non-abelian group. Prove that the group of automorphisms of G is 
not cyclic. 
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26. 


2 


28. 


29. 


30. 


. Prove that Ds and Qs are not isomorphic. 

. Show that Uj7 is isomorphic to Z16. 

. Show that the three groups Z¢, Uy and Ujg are isomorphic to each other. 

. How many pairwise non-isomorphic groups can you find which are homomor- 


phic images of $3? 


. Suppose that for each prime p, Z, is the homomorphic image of a group G. 


What can we say about |G|? 


. Prove that GL2(Z2) = $3. 
. Show that D4 is isomorphic to the subgroup of all lower triangular matrices in 


GL3(Zo). 


. Show that SL2(Zs) is not isomorphic to Sy. 
. Explain why S,, (n => 3) contains a subgroup isomorphic to D,. 
. For real numbers a and b with a # 0, define f,, : R— Rby fa,(x) = ax +b, 


forallx € R.LetG = {fap | a,b € Randa 4 0} and N = { fi, |b € R}. Prove 
that NV is anormal subgroup of G and G/N is isomorphic to the group of non-zero 
real numbers under multiplication. 

Let G = {z € C| z” = 1, for some positive integer n}. Prove that for any fixed 
integer k > 1, the function f from G to itself defined by f(z) = z*, for all 
z € G, is an onto homomorphism but is not an isomorphism. 

Let a group G contain two normal subgroups K and N. Let H be a subgroup of 
G. Prove that HK/K = HN/NifHOK=4HQON. 

In the group GL2(Zs), let 


0 -l -1 1 

Balle ol aa) 
Prove that H = Qg and H < GL2(Z3). 
Let G be a group, and suppose that S be any set for which there exists 
a bijective function f : G — S. Define a binary operation on S by setting 
a-b= f(f~'(@f~'(b)), for all a,b € S. Prove that S is a group under this 
binary operation, and that f is actually a group isomorphism. 
Let G be defined as all formal symbols x‘ y/,i = 0, 1 and j = 0,1,....n—1, 
where we assume 


vy=axly @ isi, j=’, 

x =y"=e, n > 2, 

xy= yo! x. 

(a) Find the form of the product (x! y/)(x* y!) as x” y°; 

(b) Prove that G is non-abelian of order 2n; 

(c) If 7 is odd, prove that the center of G is {e}, while if 1 is even the center of 
G is larger than {e}; 

(d) Can you interpret this group as the dihedral group of order 2n. 
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11.3. Cayley’s Theorem 


Our next subject is a classic theorem of Cayley. The proof of this theorem is not 
difficult, and it is a good exercise in group theory as it uses many concepts previously 
studied. Then, an important generalization of it is given. 


Theorem 11.39 (Cayley’s Theorem) Let G be a given group. Then, there exists a 
set X such that G is isomorphic to a permutation group on X. 


Proof We choose the set X consisting of all the elements of G. For each element 
a € G, let pq be a function on X defined by the formula 


Pa(X) = Xa, 


for all x € X. It follows that p, is a permutation on X. Furthermore, the associative 
law proves 


Pab = Pa © Pb, 


for alla, b € G. Thus, the function p is ahomomorphism from G into the symmetric 
group Sx. Clearly, p, is the identity function on X if and only if a = e. This means 
that p is one to one. Hence, G is isomorphic to the image p(G) which is a permutation 
group on X. | 


If a group G is isomorphic to a subgroup of a group H, we say G embeds in H. 
In this case there is an embedding (another word for one to one homomorphism) 
G ~ H which identifies G with its image in H. So, Cayley’s Theorem says that 
every finite group embeds in a symmetric group. 


Theorem 11.40 (Generalized Cayley’s Theorem) Let H be a subgroup of a group 
G, and X be the set of all left cosets of H in G. Then, there exists a homomorphism 
from G into the permutation group Sy whose kernel is the largest normal subgroup 
of G that is contained in H. 


Proof Suppose that H < Gand X = {xH |x € G}. Note that X need not be a group 
itself. If a is a fixed element of G, we define 6, : X — X by 6,(xH) = axH, for all 
x € G. Simulating the proof of Theorem 11.39, we find that 0, € Sy and Og, = 0,05. 
This shows that the function f : G > Sx defined by f(a) = 0, is ahomomorphism 
of G into Sy. Next, we identify the kernel of f. We have 


Kerf = {aeG| f(a) = 6} 
= {a€G|O(*xA) = 0.(xA), for all x € G} 
= {ae G|axH = xH, forall x € G} 
={aeG|x'axH = H, forall x € G} 
={aeG|x~'ax € H, forall x € G} 
={aeG|aexHx"!, forallx eG} 
= (jane "27. 


xeEG 
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Now, we claim that from this characterization of Kerf, Kerf must be the largest 
normal subgroup of G which is contained in H. To prove this, suppose that N 
is a normal subgroup of G contained in H. Using N < H, it is immediate that 
N =xNx7! C xHx7!, for all x € G. Thus, we conclude that 


N= [| she = Kerpe 
xeG 


This completes the proof. | 


Note that Theorem 11.40 reduces to Cayley’s Theorem in the special case of 
H = {e}. 

In Theorem 3.56, we proved that a Cayley table is a Latin square, i.e., the rows and 
columns are permutation of one another. However, the associative law is not easy to 
discern by the naked eye. In Theorem 3.57, we studied a method for the verification of 
the associative law. Now, by using a method similar the proof of Cayley’s Theorem, 
we give another procedure for verification of the associative law. 


Theorem 11.41 A Latin square is a Cayley table if composite of two rows is some 
row in the table. 


Proof Weuse 1, 2, ..., n to denote the entries in a Latin square and a;; denote the 
entry at the ith row and the jth column. In order to prove the statement, it remains to 
check if associative law holds in the set G = {1, 2, ..., nm} with the binary operation 
x defined by the given Latin square. We assume | is the identity element. 
Simulating the proof of Theorem 11.39, we defineo : G > S, byo(j) = 0; such 
thatio; = a;; =i» j, foralli, j ¢ G. The right cancellation law guarantees that o; 
is indeed in S,,. Although G is a set for all we know, Sg is a group. Associative law 
in G amounts to the relation ojo; = 0;,;, for all i, j € G. If that is true, then o(G) 
is a subgroup of Sg. Conversely, if o(G) is a subgroup of Sc, then the associative 
law holds, for we claim that o;0;, which is o;,, for some k € G, is indeed 0;,;. To 
see this, we show that k must be i x j. Toward this end, consider lojo; and lox. 
The former is equal to (1 *i)o; = io; =i j, while the later is equal to l*«k =k. 
Hence, it reduces to check if o(G) is a subgroup of Sg. Since Sg is a finite group, 
we need only to check that if the multiplication of any two permutations in o (G) is 
still a permutation in o(G). This can be achieved by checking the row of the Latin 
square. | 


Example 11.42 As a quick test, examine the following two Latin squares of order 
4. One of these turns out to be a Latin square that we can get from a group, while the 
other does not. Which is which? 


SIAR; ao] eo 
Qlelal ea 


Qa}; apa} 
O;apaep ays 
SSR] Qypayo 
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It is easy to check that the first square can be the Cayley table for the group Za, 
if we consider x = 0,a = 1,b = 2 andc =3. 

The second table seems so different. Assume that it is a Cayley table of a group. 
Then, for some ordering [o1, 02, 03, 04, 05] of our group elements and another 
ordering [tT], T2, 73, T4, Ts], we have 


T] T %T %T TS 
o}| x a b cd 
o.| a dc x b 
o3| b x a die 
o4| c b dea «x 
o5s|d c x bia 


One of the 01, 02, 03, 04, Os is the identity element. One of the five rows of the 
table must be [t,, T2, 73, Ta, Ts]. In particular, we can actually assume that our 
top row is formed by taking o, ' and acting to each element of [x, a, b, c, d], 
because the row [x, a, b, c, d] is just multiplying [t;, t2, 73, Ta, T5] to oj, 1.e., 
[710], 7201, 7301, T401, T5Ta] =[x, a, b, c, d], or equivalently, [t,, t2, 3, Ta, 
ts] = [xo, z ao,', bo, ', co; 4 do, ']. We can think of these as permutations of 
the row [x, a, b, c, d]. For example, the row corresponding to 02, can be considered 


as the permutation 


Similarly, the row corresponding to 03, gives us the following permutation: 


a5 eS) 
3=\pxadc)’ 


If we consider the product of 02 and 03, then we obtain 


Se Ce Se) 
203 \adcexb bxade xcdba/y 


Exercises 


ra 


How many ways are there to embed Z, in S4? 
2. How many ways are there to embed Dy in $4? 
3. Let G be a group and H be a subgroup of G. Let X = {Ha|a € G} be the set of 
all right cosets of H in G. Define, forb € G, T, : X > X by T,(Ha) = Hab"!. 
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(a) Prove that the function f : G — Sx defined by f(b) = Tp, for all b € G, is 
a homomorphism; 

(b) Describe Kerf; 

(c) Show that Kerf is the largest normal subgroup of G lying in H. 


4. Show that the function f : S, — Anj+2 given by 


we oO if o is even 
EON tle Ol a deodd, 


is an isomorphism between S, and a subgroup of A,,,2. Deduce that every finite 
group is isomorphic to a subgroup of A,+2. 

5. Show that, forn < 4, any Latin square of order n can be obtained from the Cayley 
table of a group by permuting rows, columns, and symbols; but this is not true for 
n=5. 


11.4 Automorphisms 


A homomorphism of a group G into itself is called an endomorphism. The identity 
function is clearly an endomorphism. The set of all endomorphism from G to itself 
is denoted by End(G). 


Definition 11.43 By an automorphism of a group G, we shall mean an isomorphism 
of G onto itself. 


Lemma 11.44 Let G bea group anda € G isa fixed element of G. If a function oa : 
G = G is defined by ba(x) = axa“, for every x € G, then ¢ is an automorphism 
of G. 


Proof Assume that x and y are arbitrary elements of G. Then, we have ¢a(x)¢a(y) = 
(axa~!)(aya~! = a(xy)a7! = $a (xy). So, bg is ahomomorphism. Now, we inves- 
tigate that ¢, is a bijective function. In fact, if 6,(x) = da(y), then axa! = aya', 
and so x = y. This shows that ¢, is one to one. On the other hand, for each y € G, 


we can write ¢,(a~'!ya) = aa~'yaa~! = y. This means that @, is onto. | 


The function ¢, is called the inner automorphism by a. 
Let Aut(G) denote the set of all automorphisms of G. For the product of elements 
of Aut(G), we can use the composition of functions. 


Theorem 11.45 Jf G is a group, then Aut(G) is also a group. 


Proof Note that the identity function belongs to Aut(G). Since the resultant com- 
position for functions is in general associative, it follows that the composition in 
Aut (G) is also associative. Now, the result follows immediately from Lemmas 11.26 
and 11.27. a 
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Aut(G) is called the group of automorphism of G. 


Theorem 11.46 The set ofall inner automorphisms is anormal subgroup of Aut(G), 
which is written Inn(G). 


Proof Clearly, the identity function lies in Inn(G) as id(x) = x = exe! = @,(x), 
for all x € G. Now, let a, b and x be arbitrary elements of G. We can write ¢, 0 
p(x) = bg(bxb7!) = abxb-!a7! = (ab)x(ab)—! = dap(x). This means that 


Pa 0 Pb = hab- (11.1) 


Thus, ¢, 0 db, € Inn(G). Moreover, using (11.1), we can write ¢, 0 dg-1 = dg-1 © 
ga = be. This implies that (¢,)~' = ¢@g-1 € Inn(G). Hence, we conclude that 
Inn(G) is a subgroup of Aut(G). To prove that [nn(G) is a normal subgroup of 
Aut(G), let dg € Inn(G) and f € Aut(G) be arbitrary elements. Then, for any 
x € G, we have 


(foba0 fa) = fo da(f'(@)) = faf-'()a"') 
= FASS O)LO') = f@xf@" = bya). 


This shows that f ogg 0 fat € Inn(G), and so Inn(G) <1 Aut(G). a 


Definition 11.47 Inn(G) is called the group of inner automorphisms of G. If G 
is abelian, then Jnn(G) = {e}. An automorphism of G which is not inner is called 
outer. The factor group Out(G) = Aut(G)/Inn(G) is called the group of outer 
automorphisms of G even although its elements are not automorphism. 


Theorem 11.48 The group of inner automorphisms of G is isomorphic to the quo- 
tient group G/Z(G), where Z(G) is the center of G. 


Proof We define f : G > Inn(G) by f (a) = da, for alla € G. Then, we can write 
Ff (ab) = bar = ba 0 by = f (a) f (b), for alla, b € G. This means that f is ahomo- 
morphism. Since each element of Jnn(G) is of the form @¢,, it follows that f is onto. 
Hence, by applying the first isomorphism theorem, we obtain G/ Kerf = Inn(G). 
Now, the result follows if we show that Kerf = Z(G). An element a of G lies in 
the kernel K if and only if 6, = id, Le., a~'xa = x, for all x € G. This yields that 
Kerf = Z(G). | 


Theorem 11.49 Let G = (a) be a cyclic group, and suppose that f is an endo- 
morphism of G. Then, f is an automorphism if and only if f(a) is a generator of 
G. 


Proof Suppose that f is an automorphism of G. Let x be an arbitrary element of G. 
Since f is onto, it follows that there is an element y € G such that f(y) = x. On 
the other hand, since y € (a), it follows that y = a”, for some integer n. So, we have 
x= f(y) = f(a") = f(a)”. This shows that G = (f(a)). 

Conversely, assume that G = ( f (a)). We must show that f is bijective. Let y € G 
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be an arbitrary element. Then, y = f(a)*, for some integer k. Hence, we obtain 
y = f(a‘), and so we conclude that f is onto. To prove that f is one to one, we 
consider the following two cases: 

Case 1: Let G be finite. Then, f is clearly one to one. 

Case 2: Let G be infinite, and suppose that f(x) = f(y). Then, f(a‘) = f(a’), 
where a! = x anda/ = y, for some integers i and j. So, we can write f(a)! = f(a), 
or equivalently f(a)'~/ = e. Since G is infinite, it follows that i = j, and so x = y. 
Consequently, f is one to one. | 


Theorem 11.50 /f G is an infinite group, then Aut(G) is of order 2. 


Proof The result follows from Theorem 11.49 and the fact that the number of gen- 
erators of an infinite cyclic group is 2. | 


Theorem 11.51 /f G = (a) is a finite cyclic group of order n, then Aut(G) = Uy. 


Proof Since G is finite, by Corollary 4.33, the number of generators of G is g(n). 
This yields that |Aut(G)| = y(n). Suppose that f : G — G be an automorphism 
of G. Then, we have f(a) = a*, for some integer k. Since f(a) is a generator 
of G, it follows that k <n and (k,n) = 1. Now, we define 0: Aut(G) > U,, by 
(fo = k, for all f € Aut(G). Let f and g be two elements of Aut(G) such that 
f@= a* and g(a) = a', for some integers k and /. So, we have (f o g)(a) = 
f(a’) = a" Therefore, we obtain O(f og) = kl=kl= 0(f)@(g). This shows that 
6 is a homomorphism. Now, if 6(f) = 0(g), then k = /. This implies that n|k — 1. 
Since k <n,l <n, (k,n) = land (/,n) = 1, weconclude thatk = 1, andso f = g. 
This proves that f is one to one. Finally, since |Aut(G)| = |U,,| = g(n), it follows 
that 6 is onto. Therefore, 0 is an isomorphism. | 


Theorem 11.52 Jf G is a group of order p", then |Aut(G)| = p"(p — 1). 


Proof It follows directly from Theorem 11.51. | 


Exercises 


1. Find all of the automorphisms of Ze. 

2. Prove that a +ib++ a — ibis an automorphism of C%*. 

3. Prove that Aut(S3) = S3. 

4. What are the inner automorphisms of the quaternion group Qg? Is Inn(G) = 
Aut (G) in this case? 

5. Prove that Aut(D4) = D4 and yet Dy has outer automorphism. 

6. Prove that Aut(As) = Ss. 

7. Let G be any group. Prove that the function f from G to itself defined by 
f(a) = a7", for all a € G, is an automorphism if and only if G is abelian. 
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. If Gis a group such that Aut(G) is the trivial group, prove that G is abelian and 


every element of G is of order 2. 


. If G = H, prove that Aut(G) = Aut(A) and Inn(G) = Inn(A). 
. Find two non-isomorphic groups G and H such that Aut(G) = Aut(A). 
. Prove that every finite group having more than two elements has a non-trivial 


automorphism. 


. If an automorphism fixes more than half of the elements of a finite group, prove 


that it is the identity automorphism. 


. Let G be a group and f : G > G defined by f(x) = x”, for all x € G, be an 


automorphism. Prove that for each a € G, we have a’-!€ Z(G). 


. Prove that if G is a group in which every non-identity element is of order 2, then 


G has a non-trivial automorphism. 


. Let G be the group of order 9 generated by elements a and b, where a* = b> = e. 


Find all the automorphisms of G. 


. Prove that the group of inner automorphisms of the symmetric group S, (n > 3) 


is isomorphic to Sy. 


. Prove that every automorphism of S4 is inner. What is the order of the group of 


automorphisms of $4? 


. Let G be the additive group of numbers of the form mp”, where m and n are 


integers and p is a fixed prime. Describe End(G) and Aut(G). 

Prove that if G is a finite non-cyclic abelian group, then Aut(G) is not abelian. 
Let f be an automorphism of a finite group G with the property that f(x) = x if 
and only if x = e. Suppose further that f? = id, the identity element of Aut(G). 
Show that G is abelian. 

Hint: First prove that every a € G can be represented as x~! f(x), for some 
xeG. 


11.5 Characteristic Subgroups 


Recall that a subgroup of a group G is normal if it is invariant under conjugation. 
Now, conjugation is just a special case of an automorphism of G. 


Definition 11.53 Let G be a group and H be a subgroup of G. We say that H is a 
characteristic subgroup of G, if for every automorphism f of G, f(H) = H. 


Example 11.54 Let G be a group. Then, G itself and the identity subgroup {e} are 
characteristic subgroups of G. 


Example 11.55 The center of any group is a characteristic subgroup. Indeed, let 
f € Aut(G) be arbitrary. Pick z € Z(G), then z commutes with every element of G. 
Pick an element a € G. Since f is onto, it follows that a = f(b), for some b € G. 
Hence, we can write 


af(z) = FOF) = fOD = feb) = fOFO = f@a. 
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Since a is arbitrary, it follows that f(z) commutes with every element of G, and so 
f() € Z(G). Consequently, we get f(Z(@) Cc Z(G). Applying the same result 
to the inverse of f, we obtain f~'(Z(G)) C Z(G). This implies that Z(G) C 
f (Z(G)). Therefore, we have f (Z(G) = Z(G), and this shows that Z(G) is a 
characteristic subgroup of G. 


Example 11.56 Derived subgroup G’ of a group G is a characteristic subgroup of 
G, since for every f € AUt(G) anda, b € G, we have f([a, b]) =[f(@, f(b)]. 


Example 11.57 If H is the only subgroup of G of order m, then it must be char- 
acteristic, because f(H), for all f € Aut(G), is again a subgroup of G of order 
m, 


Note that f restricted to H a characteristic subgroup (denoted by f|;,) is an 
automorphism of H (it is an endomorphism by definition of H being characteristic). 
Here are a few immediate properties of characteristic subgroups. 


Theorem 11.58 Let G be a group, and let H, K be subgroups of G. 


(1) If H is characteristic in K and K is characteristic in G, then H is characteristic 
in G (being characteristic is transitive); 
(2) If H is characteristic in K, and K is normal in G, then H is normal in G. 


Proof (1) Note that by assumption H < K <G. Let f be an automorphism of 
G. Since K is characteristic in G, it follows that f(K) = K by definition, and 
so f|x is an automorphism of K. Now, since H is characteristic in K, it follows 
that f|x(H) = H. But f|x is just the restriction of f (recall H < K), and hence 
f(D) =H. 

(2) Consider the automorphism of K given by k > aka~!,a € G, which is well 
defined since K is normal in G. For any choice of a, we get a different automorphism 
of K, which will always preserve H, because H is characteristic in K . Consequently, 
aHa™! C H. This shows that H is normal in G. | 


Theorem 11.59 Let G be a finite group and N be a normal subgroup of G. If 
(\N|,[G: N]) = 1, then N is a characteristic subgroup of G. 


Proof Suppose that |N| = m,[G : N] =n and (m,n) = 1. Then, we conclude that 
|G| = mn. Let f € Aut(G) be an arbitrary automorphism. Since N is a normal 
subgroup of G, it follows that Nf (4) is a subgroup of G. Moreover, we have 


pape UE. 
INA fF(N)| 

Assume that k = |N 1M f(N)|. Since |N| = | f(V)| = m, it follows that |Nf(N)| = 
m?/k. Since Nf(N) < G, it follows that INF (N)|IGI, or equivalently m?/k|mn. 
Since (m,n) = 1, it follows that m = k. Hence, we obtain |N M f(N)| = |N|. This 
yields that NN f(N) = N,andsoN C f(N). Finally, we conclude that N = f(N), 
and this completes the proof. | 
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Exercises 


1. Prove that every subgroup of a cyclic group is characteristic. 

2. Give an example of a group G containing a normal subgroup that is not a charac- 
teristic subgroup. 

Hint: Let G be abelian. 

3. If H is a characteristic subgroup of a group G, prove that Cg(/7) is also a char- 
acteristic subgroup of G. 

4. Let G be a finitely generated group and let H be a subgroup of G with finite 
index. Show that there is a subgroup K of H that is characteristic in G and has 
finite index in G. 

5. Let G be a group and H, K be subgroups of G. If H is a characteristic subgroup 
of G and H C K CG, prove that K/H is a characteristic subgroup of G/H 
implies that K is a characteristic subgroup of G. 


11.6 Another View of Linear Groups 


Let V be a finite dimensional vector space over a field F. The set of all invertible 
transformations of V to V is denoted by GL(V, F). This set has a group structure 
under composition of transformations. On the other hand, in Sect. 7.1, we applied the 
notation GL,, (F) for the general linear group, the group of n x n invertible matrices. 


Theorem 11.60 Let V be a finite vector space of dimension n over a field F with 
ordered basis {v,,..., U,}. Then the map 0: GL(V,F) > GL, (F) corresponding 
to this basis is an isomorphism of groups. 


Proof The result immediately follows by the properties of linear transformation. Hl 


Hence, since GL(V, F) = GL, (F), we can consider the elements of general linear 
group as invertible matrices as well as invertible linear operators on V. The special 
linear group SL, (F) is the set of all matrices A € GL,,(F) such that det(A) = 1. We 
note that if V is a finite vector space of dimension n over a field F with ordered basis 
{v,,..., Un}, andif 6 : GL(V, F) > GL, (EF) is as above, then the set 


{T € GL(V,F) | det (0(T)) = 1} 


is independent of the choice of the basis. This subset is the subgroup 07! (Ss L(n, F)) ; 
and we denote it by SL(V, F). It is clear that SL(V, F) = SL, (EF). 


Theorem 11.61 Jf F is a finite field with q elements, then 


(3) 


ISEn(F)|=¢°~" | [@' =. 
i=2 
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Proof Let F* be the multiplicative group of F. We define f : GL,(F) > F* by 
f(A) = det(A), for all A € GL, (F). It is easy to check that f is an onto homo- 
morphism. Moreover, we see that Kerf = SL,(F). So, by the first isomorphism 
theorem, we conclude that 

GL, (F) 


SL,(F) 


IIe 
= 


This implies that |SL,(F)| = |GL,(F)| - |F*|. Now, by Theorem 7.32, the result fol- 
lows. | 


Definition 11.62 (Projective Linear Groups) Let F be a field and n a positive integer. 
The set Z = {al,, | a € F*} of scalar matrices is a normal subgroup of GL, (F). The 
projective general linear group over F is defined by 


GL, (F) 


PGL,() = 7 


’ 


and the projective special linear group over F is defined by 


SLAP) ~ SL,®)Z 


PSL,(F) = a 
eink) SL(F) NZ Z 


The projective groups are obtained from the corresponding ordinary linear groups 
by identifying matrices that are scalar multiples of each other. 


An automorphism of a field F is a one to one function o from F onto itself such 
that o(a + b) = o(a) + o(b) and o (ab) = o(a)o(b), for all a, b € F. Let Aut (F) 
be the set of all automorphism of F. We denote o (a) := a’. 


Lemma 11.63 /fF isa field, then Aut (F) is a group under composition of functions. 
Proof It is straightforward. | 


Definition 11.64 Let V bea finite dimensional vector space on a field F. A function 
T : V = V is called a semi-linear transformation on V with associated field auto- 
morphism o € Aut(F), if T (vy + v2) = T(v,) + T (v2) and T (cv,) = c° T (vy), for 
allvj,v2 eV andceF. 


Example 11.65 Let V = C” be the vector space of dimension n over C, and 
o be an automorphism of C defined by (a+ bi)” =a — bi, for alla+bi €C. 
Then, T: V > V with T(aq, + Dil, ..., 4, + bi) = (4, — Dyi, ..., Qn — dni), for 
alla, + bji,...d, + b,i € C, is a semi-linear transformation. 


Let V be a vector space on a field F. We denote L(V, F) the set of all invertible 
semi-linear transformations. 


Theorem 11.66 ['L(V, F) is a group under composition of semi-linear transforma- 
tions. 
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Proof It is straightforward. | 
The group 'L(V, F) is called the semi-linear group. Also, the group 


rL(V, F) 


is called the projective semi-linear group. If dimV =n, then we write CL, (F) and 
PIL, (BP), respectively. 


Corollary 11.67 GL(V, F) is anormal subgroup of L(V, F) and we have 


L(V, F) 


——_—— = Aut (F). 
GL(V,F wm) 


YH 


Proof We define 6 : TL(V, F) > Aut(F) by 6(T) =o, where o is the associated 
field automorphism of T. It is clear that 6 is an onto homomorphism with Ker@ = 
GL(V, F). This completes the proof. a 


Exercises 


1. Prove that 
(a) PSL,(F) I PGL,(F) 3 PVL,@); 
PTL, (F) 
b) ——— = Aut (F). 
0) 5 GL, (F) ut(F) 
2. Let F, be a finite field with q elements. Prove that 


(a) PSL2(F2) = SLo(F2) = GL2(F2) = $3; 
(b) PSL2(F3) = Ag. 


3. Show that the set of all matrices of the form 


ie th 


is a group isomorphic to D,,, where all entries in the matrix are in Z,. 
4. Let C© be the complex plane augmented by an extra point oo. Leta, b,c,d € C 
be such that ad — bc £ 0. Define f : C* > C™ by 
az+b., d 
f@= if z #0, z#-— 
Fi cz +d c 
if c £0, then f(- ) hie 
C 
a 
c 
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and by ah 
az 


f@= 
a 

f(co) = = 
Show that under composition these functions form a group G and G is a homo- 
morphic image of GL2(C). 

5. Give an example of an outer automorphism of a group G which maps each con- 
jugate class of G onto itself. 
Hint: Take 


ifz~Aw, 
if c= 0, then e 


loa 
= {| 4 | | a,b € Zs and (b, 8) = 1}. 


Show that G is a group under matrix multiplication. Define f : G > G by 


b*—1 


Hee al 2 


0 b 


11.7 Worked-Out Problems 


Problem 11.68 Let G and H be finite groups such that (|G|, |H|) = 1, prove that 
the only homomorphism from G to H is the zero homomorphism. 


Solution Suppose that f is a homomorphism from G to H. By the first isomor- 
phism theorem, we can write G/Kerf = Imf. Since Kerf < G, it follows that 
|Kerf |||G|. Consequently, we get (|G/Kerf|, |H|) = 1. Analogously, since Imf < 
H, it follows that |Imf ||| H|. Hence, we conclude that |G/Kerf| = |Imf| = 1. This 
yields thatG = Kerf .Therefore, f sends every element of G to the identity element 
of H, and this means that f is the zero homomorphism. | 


Problem 11.69 If G is a perfect group, prove that the center of G/Z(G) is trivial. 


Solution For convenience we set H = Z(G). Assume that the center of G/H is 
non-trivial. This means that there is an element aH € Z (G/ H ) such that a ¢ H. 
Since aH € Z(G/H), it follows that aHxH = xHaH, for all xH € G/H. This 
implies that axH = xaH, or equivalently a~'!x~!ax € H. Now, we can define a 
function f : G—> H by f(x) =a~'x~!ax, for all x € G. We show that f is a 
homomorphism. Let x and y be elements of G. Then, we find that 

a (xy)“!a(xy) = a7 y7!x7laxy 
= (a! y"!ay)y"!(a7!x~1ax)y 

= (a'x7!ax)(a!y“!ay) = f(x) fQ). 


f(xy) 


Thus, f isa homomorphism. By the first isomorphism theorem, we have G/Kerf = 
Imf < H. As H is abelian, we conclude that G/Kerf is abelian. Consequently, 
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by Theorem 10.4, we obtain G’ C Kerf. Since G is perfect, we have G = G’, 
which implies that G = Kerf. Now, if G = Kerf, then f(x) =e, for all x € G. 
So, we obtain a~!x~!ax =e or ax = xa. This shows that a € H, which is a 
contradiction. Therefore, we conclude that aH = H or a H. This proves that 


Z(G/H) ={H}. a 
Problem 11.70 Suppose that 


(1) G and H are groups and G is finite; 
(2) f :G— AH isa homomorphism; 
(3) A and B are subgroups of G such that G = AB and (\A], |B) = 1. 


Prove that for each normal subgroup N of G, N = (NM A)(N NB). 


Solution Let m : G — G/N be the canonical map. Since z is onto, it follows that 
m(G) = G/N. Then, we have 


mu(G) = 27(AB) = 7(A)x(B) = G/N, 


m(A) = {1(a) | a € A} = {aN |a € A} = (AN)/N, 
m(B) = {1(b) |b € B} = {bN |b € B} =(BN)/N. 


eel ee: 


On the other hand, by the second isomorphism theorem, we have 


Thus, we conclude that 


A AN B . BN 
= and = . 
ANN N BON N 


So, we deduce that A 
N BN 
IF lfiAl ana ||| 


Since (|A|, |B|) = 1, it follows that (|(AN)/N|, |(BN)/N|) = 1. Hence, we obtain 


AN _ BN 
“nN — ={N} and AN B = {e}. 
N N 


Now, we can write |G| = |AB| = |A| |B| and 


[S| = |2%) )F"| |A| |B| |G| 


NI | NITN I JANNIIBAN| [ANNIBOANI 


This implies that 
IN| =|ANN||BNN. (11.2) 


Moreover, since AN N < N and BON < N, we have 


11.7. Worked-Out Problems 275 
(AN N)(BON) <N. (11.3) 


Now, by (11.2) and (11.3), we conclude that VN = (AN N)(BON). a 


Problem 11.71 Prove that there are only two groups of order 6, one is cyclic and 
another is isomorphic to S3. 


Solution Suppose that G is a group of order 6. Since there exist 5 non-identity 
elements and x <> x7! is a one to one correspondence between the elements of G 
and their inverses, it follows that there exists a non-identity element a € G such that 
o(a) = 2. 

Let G be abelian and H = (a). Then, we have |G/H| = 3, which implies that 
G/H is cyclic. Assume that G/H = (bH), for some b € G. Since o(bH)|o(b), it 
follows that o(b) = 3 or 6. If o(b) = 6, then G is cyclic. If o(b) = 3, then since 
ab = ba and (0(a), o(b)) = |, it follows that o(ab) = 6. Hence, again in this case 
we deduce that G is a cyclic group generated by ab. 

Now, assume that G is non-abelian. Then, all non-identity elements of G can 
not be of order 2. Thus, there exists c € G such that o(c) = 3 and ac # ca. Take 
N = (c). Since [G : N] = 2, it follows that N is a normal subgroup of G. Hence, 
we get aca~! € N. This yields that aca~! = c or aca~! = c?. Since ac # ca, we 
conclude that aca~! = c?, or ca = ac?. On the other hand, we know that a can 
not be in N, so we can write G = N UaN = {e, c, c*, a, ac, ac’}. Now, it is 
a routine verification to check that the function f : G — $3 given by f(e) = id, 
f(a) = (12), fc) = (123), f(c?) = (132), Flac) = (2 3) and f (ac?) = (13) 


is an isomorphism. This completes the proof. a 
Problem 11.72 Prove that any non-abelian group of order 8 is either D4 or Qs. 


Solution Suppose that G is a non-abelian group of order 8. Since G is non-abelian, 
it follows that G is not cyclic. So, G don’t have any element of order 8. If every 
non-identity element of G is of order 2, then again G must be abelian. Hence, there 
is a non-identity element a € G such that the order of a is neither 8 nor 2. Since 
o(a)||GI, it follows that o(a) = 4. If N = (a), then [G : N] = 2, and so N IG. 
Take b € G such that b ¢ N. Then, we obtain 


G=NUDN ={e, a, a’, a’, b, ba, ba’, ba’*}. 


This yields that G = (a, b). Since G is non-abelian, it follows that ab 4 ba. Since 
N <1G, it follows that bab~! € N. Moreover, o(bab~! = o(a) = 4. In the subgroup 
N, the only element of order 4 other than a is a®, so we conclude that bab~! = a3 = 
a_'. Now, G/N is of order 2, so (bN) = N, or equivalently bee {e, a, a, a}. 
If b? = a or b? = a’, then o(b) = 8. This is impossible, because G is non-abelian. 
Thus, we conclude that b? = e or b* = a’. If b* = e, then G is Dg (see Exercise 30 


in Sect. 11.2). If b? = a, then G is the quaternion group. | 


Problem 11.73 Let G be a group and H be a subgroup of G. Prove that Ng(H)/ 
Cg(A) is isomorphic to a subgroup of Aut(#7). 


276 11 Group Homomorphisms 


Solution Define 6: Ng(H) > Aut(A) by 6(a) = 4, for all a € NG(A), in which 
6,(h) = aha™', for allh € H. Since a € Ng(H), it follows that aHa~! = H. This 
shows that 6,(h) € H. Moreover, we observe that 0, is a bijective homomorphism 
from H onto itself, and so 6, € Aut(H). Now, itis easy to check that 6 is ahomomor- 
phism with Ker@ = Cg(#). At the end, the result follows by the first isomorphism 
theorem. a 


Problem 11.74 Let G bea finite group and suppose that automorphism f sends more 
than three-quatres of the elements of G onto their inverses. Prove that f(x) = x~!, 


for all x € G and that G is abelian. 


Solution Set A= {a€G| f(a) =a"!}. If x € A, then |AUxA| = JA] + |xA] — 
|ANxA|. Since AUxA CG, it follows that |A UxA]| < |G]. Since the function 
h: A-— xA defined by f(a) = xa, for all a € A, is a one to one correspondence, 
it follows that |G| > |AUxA|=|A|+|A|—|AMxA|. This implies that |G| > 
3|G|/4+ 3|G|/4 —|ANxA| or |ANxA| > |G|/2. Now, assume that xa; and xa 
are two elements of AN xA, then f(xa;) = f(x) f(a) = Kgs? and f(xa;) = 
(xa;)~! = a; 'x~!. Hence, we conclude that a7'x—! = x~!a;7', which implies that 
x and a; commute. Since there are more than |G|/2 elements in A 1 xA, it follows 
that |Cg(x)| > |G|/2. Now, by Lagrange’s theorem, we must have G = Cg(x), and 
this yields that x € Z(G). Since this is true for all x € A, it follows that A C Z(G). 
Thus, we obtain 3|G|/4 < |A| < |Z(G)|. This shows that G = Z(G), because Z(G) 
is a subgroup of G. Consequently, G is abelian. So, A is a subgroup of G such that 
3|G|/4 < |A|. By Lagrange’s theorem we deduce that G = A. | 


Problem 11.75 Prove that an automorphism of S, which sends transpositions to 
transpositions is an inner automorphism. 


Solution Suppose that f : S, — S, be an automorphism mapping transpositions to 
transpositions. For two distinct transpositions we can consider two cases: 


(1) (a b)(c d), which is of order 2; 
(2) (a b)(ac) = (abc), which is of order 3. 


Therefore, we can say that f maps any pair of disjoint transpositions to a pair of 
disjoint transpositions. Let f (( 2)) = (r s) and suppose that 3 < x <n. Since 
(1 2)(1 x) is a cycle of length 3, it follows that 


f(A. 2)0 x)) = f (C1 2)) F(x) = 5) F(x). 


Hence, f (dl x)) moves either r or s. Without loss of generality, we may suppose 
that it moves r. Consequently, we can write f (d x)) = (r t), for some ¢ different 
from r and s. 

Now, we claim that foreach y 4 1, we have f(a y)) = (ru),forsomel <u <n 
and different from r. 

If y = 2 or y = x, the claim is true. 

If y A 2 and y ¥ x, then both permutations (1 y)(1 2) and (1 y)(1 x) are cycles 
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of length 3, so are their images under f. As f((12)) = (rs) and f((1 x)) =(r 0), 
if f(a y)) is not moving r, then it must move both s and ¢. Since f(d y)) isa 
transposition, it follows that f(C y)) = (st). It is easy to see that (7 s)(r t)(s t) = 
(r t), and so f~'((r s)) f-'((r t)) f-'((s t)) = f 7! ((r 1). Consequently, we get 
(12) x)d y) = (1 x). Since y ¢ {1, 2, x} and x ¥ y, we obtain a contradiction. 
Therefore, we conclude that f(a y)) must move r, and hence f(a y)) = (ru), for 
some 1 < u <n and different from r. 

Now, we define o € S, as follows: 


eee r ify=1 
y= Vu ify #1, 


where u is the unique element for which f (d y)) = (r u). Also, assume that g : 
S, —> S, denote the conjugation by o. For every y we obtain 


e (fy) =e 6) HeC wa = (19). 


This means that g~!f fixes all permutations of the form (1 y), for all y. Since 
(a b) = (1b)(1a)(1 D), for all a and D, it follows that g~' f fixes every permutation 
of S,. Thus, we obtain f = g. This completes the proof. | 


Problem 11.76 A group G is complete if it is centerless and every automorphism 
of G is inner. Ifn ¢ 2 andn # 6, prove that S, is complete. 


Solution Let C(k) denote the the conjugate class in S, consisting of all products 
of k disjoint transpositions. We know that a permutation in S, is of order 2 if and 
only if it belongs in some C(k). So, if f € Aut(S,,), then f(C(D) = (C(&), for 
some positive integer k. We claim that if n 4 6, then |C(1)| 4 |C(k)|, for k A 1. 
Assuming this, then f(C()) = C(1), and Problem 11.76 completes the proof. 

Clearly, we have C(1) = n(n — 1)/2. In order to count the elements of C(k), we 
observe that there exist 


n(n—1) (n—2)(n—3) (n — 2k — 2)(n — 2k +1) 
—- ; —— 5 


k-tuples of disjoint transpositions. Since disjoint transpositions commute and there 
are k! orderings obtained from any k-tuples, we obtain 


n(n —1)(n—2)...(n —2k +1) 
k12k 


IC(K)| = 


Now, the question whether |C(1)| = |C(k)| leads to the question whether there is 
some k > | such that 


(n —2)(n —3)...(n —2k +1) = k!2*. (11.4) 


Since the right side of (11.4) is positive, it follows that n > 2k. Hence, we have 
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@= DG =3) cn DES) S ORS DOR = 9)... Oh = 2D) 
= (2k —2)!. 


Moreover, by mathematical induction, we find thatifk < 4, then (2k — 2)! > ktak-!, 
Consequently, (11.4) is true if k = 2 ork = 3. If k = 2, then the right side of (11.4) 
is 4 and it is easy to see that equality never holds. So, we suppose that k = 3. Since 
n > 2k, it follows that n > 6. If n > 6, then the left side of (11.4) is greater than or 
equal to 5-4-3-2 = 120, while the right side of (11.4) is 24. Therefore, we have 
proved that ifn ¢ 6, then |C(1)| 4 |C(k)|, for all k > 1, as desired. | 


Remark 11.77 S$; = Z» is not complete because it has a center. We shall see in 
future that S¢ is not complete too. 


Problem 11.78 Let H be a subgroup of a group G < GL, (C). If the set 


{[x, y] |x € Gand ye H} 


is a set of order m, prove that [G : Cg(H)] < m" 


Solution If y € H, then y has at most m conjugate x~!yx = y[x, y]~! 


x € G). Therefore, we have 


in G (where 
IG: Cg] <m. 

Suppose that L(H) = {cjhy +--+ +Cmhm | ci € C and h; € H}. If we define addi- 

tion and multiplication by scalars in an obvious manner for elements in L( #7), then 

L(A) is a vector space over C. The set of all n x n matrices with entries in C 

is a vector space over C containing L(H) as a subspace. Since the former space 

has dimension n?, it follows that the dimension L(H) is at most n?. Assume that 


dimL(H)=k< n*. Then, we can find elements y1,---, Yk in A which form a basis 
for L( 1). Consequently, we have 


k 
Cg(B) = () CeGi). 
i=l 
So, we have 
k k , 
IG 3 Cath] =(6 2 [| CeG)1S | [Ie >CeQi)] <m". 
i=l ray 


This completes the proof. a 
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11.8 Supplementary Exercises 


3: 


10. 


11. 


12. 


. Let G be a group, and let S be any set for which there exists a one to one 


and onto function f : G— S. We define a binary operation on S by setting 
xy=f (f@)f"'O)), for all x, y € S. Prove that S is a group under this 
operation, and that f is actually a group isomorphism. 


. Let G be a group of order 12. Show that either G has a normal subgroup of order 


3 or G is isomorphic to Aq. 


. Let f : Gj — G» be an onto homomorphism. Let H; be a normal subgroup of 


G, and suppose that f(H,) = Hp». Prove or disprove that G,/H, = G2/Ap. 


. Let N; and N, be two normal subgroups of G. Prove or disprove that 


(a) Nj = No implies G/N; = G/Nao; 
(b) G/N, = G/N2 implies Ny = No. 


Let H be the subgroup of GL2(Z;) as follows: 


a={|5 ‘l € GL2(Z3) | a,b € Z3, a #0}. 


Show that H is isomorphic to the symmetric group $3. 


. Show that the subgroup fid, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)} of Ag is 


isomorphic to the group of plane symmetries of a chessboard. 


. Show that to each positive integer there exist only a finite number of pairwise 


non-isomorphic groups of order n. 


. Prove that distinct automorphisms of the symmetric group S4 induce distinct 


automorphisms of the alternating subgroup Ay. What is the order of the group 
of automorphisms of A4? 


. Let G and H be groups, f : G — H be ahomomorphism with kernel NV, and K 


be a subgroup of G. Prove that f~'(f(K)) = KN. Hence, f~!(f(K)) = K if 
and only if N < K. 

If G is metabelian and f : G — H is a homomorphism, prove that f(G) is 
metabelian. 

Prove that 


(a) If G is anon-abelian group, Aut(G) can not be cyclic; 
(b) There is no finite group G for which Aut(G) is cyclic of odd order greater 
than 1. 


Show that if G is a group with trivial center (Z(G) = {e}), then its group of 
automorphisms, Aut(G), is also a group with trivial center. 

Hint: Let f € Z(Aut(G)). For any a € G, let gg € Inn(G). Then f o¢, = 
da o f (Why?). Use this to show that for any x € G, a! f(a) € Ce(f()). 
Infer the result from this. 


. Suppose that Z(G) = {e} and Inn(G) is a characteristic subgroup of Aut(G). 


Show that any automorphism of A is inner. 
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14. Let G be a group and H a normal subgroup of G with the property that [G : 


15. 


16. 
17. 


18. 
19. 


20. 


21. 


— 


22. 
23; 


24. 


H|=4. Let K be an arbitrary subgroup of G. Show that if the factor group 
K/K ( G is non-trivial, then it is isomorphic to either Z, or Z4, or K4. 

Let H be a subgroup of G. Prove that the factor group Ng(H)/Cg(A) is iso- 
morphic to a subgroup of Aut(#). 

Prove that if G = H, then G’ = H’ and Z(G) = Z(A). 

Suppose that G; and Gz are finite perfect groups such that Gj/Z(G1) = 
G2/Z(G2). Prove that there exists a finite perfect group G and subgroups 
H,, Hy < Z(G) with G/Z(G) = G;/Z(G;) and G/H; = G;, fori = 1, 2. 
Show that no group can have its automorphism group cyclic of odd order. 

Let G be a finite group. If k(G) is the number of conjugate classes of G, show 
that 


(a) k(G) = 2 if and only if G = Z); 
(b) k(G) = 3 if and only if G = Z; or G = 83; 
(c) What can you say about finite groups with 4 or 5 conjugate classes? 


Let p be a prime and Z(p™) be the following subset of the group Q/Z: 
Z(p®) = {a/b € Q/Z| a,b € Zand b = p’' for some i > 0}, 


where a/b = a/b + Z. Show that 


(a) Z(p®) is an infinite group under the addition operation of Q/Z; 

(b) Every element of Z(p®) has finite order p” for some n > 0; 

(c) If atleast one element of a subgroup H of Z(p®) has order p*, and no element 
of H has order greater than p*, then H is acyclic subgroup generated by 1/p*, 
whence H = Z,x; 

(d) The only proper subgroups of Z(p%) are the cyclic groups (1/p”) (n 
152,522; 

(e) If x1, x2,... are elements of an abelian group G such that o(x,) = p, px. = 
X1, PxX3 = Xo,..., PXn41 = Xp, --., then the subgroup generated by x;’s (i > 
1) is isomorphic to Z(p™); 

(f) If H is a proper subgroup of Z(p®), then Z(p~)/H = Z(p®). 


If G has order n > 1, prove that 


k 
|Aur(G)| < | [@—25, 


i=0 


where k = [log,(n — 1)]. 

Find a finite group G with anormal subgroup N such that |Aut(N)| > |Aut(G)|. 
Let G be a group of order pq, where p and q are distinct prime numbers, and 
if G has a normal subgroup of order p and a normal subgroup of order qg, prove 
that G is cyclic. 

Let G be a group of order pg, where p > q are primes. Prove that 


11.8 Supplementary Exercises 281 


(a) G has a subgroup of order p and a subgroup of order gq; 
(b) Ifg Jp — 1, then G is cyclic; 
(c) Given two primes p and q with g|p — 1, there exists a non-abelian group of 


order pq; 
(d) Any two non-abelian groups of order pq are isomorphic. 
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Torsion-free group, 94 

Torsion subgroup, 98 

Totally ordered set, 11 

Total order relation, 11 

Trace of a square matrix, 164 
Transformation of the plane, 31 
Translation, 34 

Translation group, 180 
Transpose of a matrix, 163 
Transposition, 113 

Trivial subgroups, 76 
Two-sided inverse, 57 
Two-sided zero element, 56 
Two-step subgroup test, 76 


U 

Union, 2 

Union of a family of sets, 4 
Unit, 89 

Unit vector, 167 


Index 


Upper bound, 12 
Upper triangular, 153 


Vv 

Vandermonde matrix, 193 
Vector, 166 

Vector projection, 168 
Vectors in R”, 166 


Ww 

Well ordering principle, 18 
Whole numbers, 2 
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